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When considering a graphical Gaussian model Ng Markov with 
respect to a decomposable graph G, the parameter space of interest 
for the precision parameter is the cone Pg of positive definite ma- 
trices with fixed zeros corresponding to the missing edges of G. The 
parameter space for the scale parameter of Mg is the cone Qg, dual 
to Pg , of incomplete matrices with submatrices corresponding to the 
cliques of G being positive definite. In this paper we construct on 
the cones Qg and Pg two families of Wishart distributions, namely 
the Type I and Type II Wisharts. They can be viewed as general- 
izations of the hyper Wishart and the inverse of the hyper inverse 
Wishart as defined by Dawid and Lauritzen [Ann. Statist. 21 (1993) 
1272-1317]. We show that the Type I and II Wisharts have proper- 
ties similar to those of the hyper and hyper inverse Wishart. Indeed, 
the inverse of the Type II Wishart forms a conjugate family of priors 
for the covariance parameter of the graphical Gaussian model and is 
strong directed hyper Markov for every direction given to the graph 
by a perfect order of its cliques, while the Type I Wishart is weak 
hyper Markov. Moreover, the inverse Type II Wishart as a conjugate 
family presents the advantage of having a multidimensional shape 
parameter, thus offering flexibility for the choice of a prior. 

Both Type I and II Wishart distributions depend on multivariate 
shape parameters. A shape parameter is acceptable if and only if it 
satisfies a certain eigenvalue property. We show that the sets of ac- 
ceptable shape parameters for a noncomplete G have dimension equal 
to at least one plus the number of cliques in G. These families, as 
conjugate families, are richer than the traditional Diaconis-Ylvisaker 
conjugate families which all have a shape parameter set of dimension 
one. A decomposable graph which does not contain a three-link chain 
as an induced subgraph is said to be homogeneous. In this case, our 
Wisharts are particular cases of the Wisharts on homogeneous cones 
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as defined by Andersson and Wojnar [J. Theoret. Probab. 17 (2004) 
781-818] and the dimension of the shape parameter set is even larger 
than in the nonhomogeneous case: it is indeed equal to the number 
of cliques plus the number of distinct minimal separators. Using the 
model where G is a three-link chain, we show by computing a 7-tuple 
integral that in general we cannot expect the shape parameter sets 
to have dimension larger than the number of cliques plus one. 

1. Introduction. The primary aim of this paper is to develop a new 
family of conjugate prior distributions with attractive Markov properties 
for the covariance parameter, or equivalently the precision parameter, of 
graphical Gaussian models Markov with respect to a decomposable graph G. 
While doing so, we are led to define two new classes of Wishart distributions 
and their inverses and to study their properties. 

Let us recall that an undirected graph is a pair (V, £) where V = {1, . . . ,r} 
and £ is a family of subsets of V of size 2. It will be convenient to 

consider the set E C V x V of (i, j) such that either i = j or {i, j} is in £, 
rather than £ and, since E and £ carry the same information, to speak about 
the graph G = (V, E). Any such that i ^ j will be called an edge. An r- 
dimensional Gaussian model is said to be Markov with respect to G if for any 
edge not in E, the ith and jth variables are conditionally independent 
given all the other variables. Such models are known as covariance selection 
models (see [8]) or graphical Gaussian models (see [18] or [11]). Without loss 
of generality, we can assume that these models are centered JV r (0, £), and 
it is well known that they are characterized by the parameter set Pq of the 
precision matrices, which is the set of positive definite matrices K = X" 1 
such that Kij = whenever the edge is not in E. Equivalently, if we 
denote by M the linear space of symmetric matrices of order r, by M+ C M 
the cone of positive definite (abbreviated > 0) matrices, by Iq the linear 
space of symmetric incomplete matrices x with missing entries Xij, ^ E, 
and by ir : M i— > Iq the projection of M into Iq, the parameter set of the 
Gaussian model can be described as the set of incomplete matrices Eg = 
7r(E) with £ = K" 1 and K S Pq. Indeed it is easy to verify that the entries 
Si,-, E are such that 

and are therefore not free parameters of the Gaussian models. One can prove 
that the correspondence between K and the incomplete matrix T,q = tt(X!) 
is one to one. We write Eg = <p{K) = t:(K~ 1 ). We note that ip is not explicit 
when G is not decomposable. 

Henceforth in this paper, we will assume that G is decomposable. The 
reader is referred to [11] for all the common notions of graphical models 
used in this paper. We will now simply recall some basic facts and tradi- 
tional notation we will use throughout this paper. Every decomposable graph 
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admits a perfect order of its cliques. Let (Ci, . . . , Ck) be such an order. We 
use the notation Hi = R\ = C%, while for j = 2, . . . , k we write 

Hj = C\ U ■ • • U Cj, Rj = Cj \ Hj—i, Sj = n Cj. 

The Sj,j = 2,...,k, are the minimal separators of G. Some of these sep- 
arators can be identical. We let k' < k — 1 denote the number of distinct 
separators and v(S) denote the multiplicity of S that is the number of j 
such that Sj = S. Lauritzen [11] has proven that the multiplicity v(S) of a 
given minimal separator S is positive and independent of the perfect order 
of the cliques considered. 

For G given decomposable with the set of cliques {Ci, . . . , Ck) and E -1 € 
Pg, the incomplete matrix Eg is completely determined by its submatri- 
ces {Ec i , i = 1, . . . , k} where, of course, for each i = I, . . . ,k, is positive 
definite. When considering the parameter space of the graphical Gaussian 
model corresponding to G? decomposable, we are therefore led to consider 
the two cones 

(1.1) P G = {y e M+\ yij = 0, i E}, 

(1.2) Q G = {xeI G \x Ci >0,i = l,...,k}. 

Dawid and Lauritzen ( [7] , Section 7) defined two distributions on Qg , namely, 
the hyper Wishart distribution as the distribution of the maximum likeli- 
hood estimator of Eg, and the hyper inverse Wishart distribution as the 
Diaconis-Ylvisaker conjugate prior distribution for Egi. Subsequently Rover- 
ato [16] gave the distribution of K = E _1 = (^^(Eg) when Eg follows the 
hyper inverse Wishart distribution. We will call this distribution of K on 
Pg the G- Wishart. The search for a rich and flexible class of conjugate prior 
distributions for Eg, or equivalently for K = E _1 , remains a topic of high 
interest to statisticians. 

When G is complete, Pg = Qg = and we define the regular Wishart 
distribution on the cone of positive definite matrices of dimension r = \ V\ 
by 

-(l/2)tr( :E E-l)| x |p| a; |-(r + l)/2 lM+(a;)dX) 



2 r PT r {p)\E\P 



where p > is the one-dimensional shape parameter and E £ is the 
scale parameter. 

When G is decomposable, the hyper and hyper inverse Wisharts have been 
constructed as a Markov combination (with respect to G) of the Wishart 
and its inverse, respectively, and so, like the Wishart, they have a one- 
dimensional shape parameter and a scale parameter in Qg- Dawid and Lau- 
ritzen [7] have shown that these distributions have Markov properties: the 
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hyper Wishart is weak hyper Markov while the hyper inverse Wishart is 
strong hyper Markov. 

In this paper, we will construct a family of distributions, called Type 
I Wisharts, defined on Q G , and another family, called Type II Wisharts, 
defined on Pq. We shall see in Section 4 that the inverses of the Type II 
distributions, like the hyper inverse Wisharts, form a family of conjugate 
prior distributions for the scale parameter of the graphical Gaussian model. 
We will also show that they are strong directed hyper Markov in the direction 
given to the graph G by any choice of a perfect numbering of its vertices. This 
property is parallel to the strong hyper Markov property of the hyper inverse 
Wishart. We will also show that the Type I Wishart is weak hyper Markov, a 
property parallel to the weak hyper Markov property of the hyper Wishart. 
The attractive feature of the inverse Type II Wishart family of conjugate 
distributions is that, except in the trivial case where G is complete, the set 
of shape parameters is of dimension strictly greater than the number k of 
cliques in G, thus offering a flexible class of conjugate prior distributions 
for Yiq. We shall also note in Section 4 that it forms a class of enriched 
conjugate priors for Hq in the sense of [5]. 

To construct these two families, we define two natural exponential families 
of distributions affiliated with the Wishart, one on Q G and one on Pq. Let 
(Ci, . . . , Ck) denote a perfect order of the cliques of G and let (52, . . . , Sk) 
be its corresponding sequence of minimal separators, some of them being 
possibly identical. We consider functions of the type 

Y\ k Wn l a » 

H G (a, (3; x) = C ' , x € Q G , 

where a and (3 are two real-valued functions on the collections C and S of 
cliques and separators, respectively, such that a(Q) = cti,P(Sj) = Pj with 
Pi = Pj if Si = Sj . These functions play a very special role in the definition 
of the two families of distributions that we define. Indeed, if we let Cj = |Cj| 
and Si = | | denote the cardinality of Cj and Si, respectively, and if we 
denote 

n =i \xc H ci+i )/ 2 

»G{dx) = I ■ l Qc (x)dx, 

l li=2 I x Ai I 

the family of distributions we define on Qq is, for a given (a, (3), the natural 
exponential family generated by 

(1.3) H G (a, P; x)fi G (dx) = H G (a - ±(c + 1), - |(s + 1); x)l Qa (x) dx. 

The measure (1.3) can be seen as a Markov combination generalization of 
the measure \x\ p \x\~( r+1 ^ 2 l M +(x) dx generating the Wishart distribution, 
for a given p > • 
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In Section 3 we will introduce the set A of (ot,/3) such that the following 
integral converges and satisfies 

(1.4) f e-^Hda, (3; x)fi G (dx) = T^a, (3)H G (a, (3; <p(y)), 
JQg 

where T\{a, [3) is some function of (a, (3) independent of y £ Pq- When (a, (3) 
is in A we say that Hc(ct, (3; x) has the eigenvalue property with correspond- 
ing eigenvalue Ti(a, [3) and we define the Type I Wishart distribution on Qg 
as the distribution with density 

— - — — ^— ——e-^Haia, (3; x)^(dx) 
Ti(a,f3)H G (a,(3;ip{y)) 

and with parameters (a, f3, y). In a parallel way, we define a set B of (a, (3) for 
which an eigenvalue property similar to (1.4) holds for the Type II Wishart 
distribution defined on Pa- 
in order to fully describe the Type I and II Wishart distributions, it is then 
necessary to know the sets A and B. In Section 3.2 we show that, for any G, 
the hyper Wishart and the G- Wishart are particular cases of Type I and II 
distributions, respectively. More precisely, we describe the sets A\ C A and 
B± C B such that for (a, (3) € Ai, the Type I Wishart is the hyper Wishart 
and for (a,{3) G B±, the Type II Wishart is the G- Wishart. In Section 3.3 
we consider the particular class of decomposable graphs G which do not 
contain the three-link chain, which we call Aj, as an induced subgraph. 
Such graphs are called homogeneous. When G? is homogeneous, we describe 
the sets A and B completely and show that they are open sets of dimension 
k + k' , the number of cliques plus the number of distinct separators in G. 
For G homogeneous, the cones Qg and Pq are homogeneous and we see 
that the Type I and II Wisharts then belong to the class of Wisharts on 
homogeneous cones defined by Andersson and Wojnar [3]. In Section 3.4 we 
consider nonhomogeneous graphs. In that case, we have, so far, only partial 
knowledge of A and B. For each perfect order P of the cliques, we define a 
(k + l)-dimensional subset Ap of A such that, for (a, (3) € Ap, (1.4) holds. 
We therefore know the subset U^P °f A, but not all of A. Similarly we 
define a (k + l)-dimensional subset Bp of B such that we know the subset 
U Bp of B, but not all of B. We conjecture that the equalities A = [j Ap and 
B = [J Bp hold in general for nonhomogeneous graphs and that, thus, the 
dimension of the manifolds A and B is generally k + 1 < k + kf . In Section 
3.4 we verify that these two equalities hold when G = A^ by computing, in 
this case, the 7-tuple integral corresponding to (1.4). 

In Section 4 we give the conjugacy and hyper Markov properties men- 
tioned above. We also give the Laplace transforms of the Type I and II 
Wisharts and the expected values of the Type I, Type II and inverse Type 
II Wisharts. The necessary preliminaries for understanding the cones Pq 



G 



G. LETAC AND H. MASSAM 



and Qg and the measures we define on them are given in Section 2.1. In 
Section 2.2 we give the results needed to work with homogeneous graphs. 
Most proofs are deferred to the Appendix. 

2. Preliminaries. 

2.1. Measures on Pq and Qg- For the graph G = (V,E), V = {1, . . . , r}, 
we write i ~ j to indicate that the edge {i, j} is in £. An undirected graph 
G is said to be decomposable if it does not contain a cycle of length greater 
than or equal to four as an induced subgraph and if it is connected. For all 
the notions related to decomposable graphs that we will introduce below, 
the reader is referred to [11], Chapter 2. We denote by Zq the real linear 
space of symmetric matrices y of order r such that yij = if £ E. We 
denote by Iq the real linear space of functions *— * x^j from E to R 
such that The elements of Iq are called G-incomplete symmetric 

matrices. For a decomposable graph, we have defined the cones Pq C Zq 
and Qg C Ig i n (1-1) and (1-2). Recall that M+ denotes the cone of positive 
definite symmetric matrices of order r. Grone et al. [10] proved the following. 

Proposition 2.1. When G is decomposable, for any x in Qg there 
exists a unique x in M+ such that for all in E we have Xij = Xij and 

such that x -1 is in Pq. 

This defines a bijection between Pq and Qg, 



where ir denotes the projection of M onto Iq- The explicit expression of 
x~ l is given in (2.3) below. For (x,y) G Iq x Zq, we write tr(xy) = (x,y) = 
J2(i,j)<=E x ijUij- By Proposition 2.1, we have for x € Qg {x, y) = tr(xy), where 
tr(xy) is defined in the classical way. Thus although xy does not make sense, 
the notation tr(xy) is quite convenient. We also use the following notation: 
if C is a complete subset of vertices and if xc = (^ij)jj'ec is a matrix, we 
denote by (xc)° = (xjj)j je y the matrix such that x^ = for C x C. 

The following theorem gathers some basic results on decomposable graphs. 
Part 1 is due to Andersson [2], parts 2 and 3 can be found in [11], Chapter 
5 and part 4 is due to Roverato [16]. 

Theorem 2.1. Let G be a decomposable graph. Then: 

1. The convex open cones Pq and Qg are dual to each other in the sense 
that 



tp:y=(x) 1 G P G i-> x = tp(y) = ir(y 1 ) G Qg 



(2.1) 
(2.2) 



Pg = {y e Z G ; tr(xy) > Vx G Q G \ {0}} 
Q G = {x G I G ; tr(xy) > Vy G P~g \ {0}}. 
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2. For x G Qg we have that y = x~ 1 is in Pq and 

(2.3) y=Y,^c 1 )°-J2^ S )( x s 1 )°- 

3. For x G Qg we have 

det£ = TT n ^ (detX ^ . 
n 5G5 (detx s )^ s ) 

4. T/ie absolute value of the Jacobian of the bijection x h- ► y = x" 1 /rom 
Q G to P G is 

(2.4) J] (detx c )" |Chl I] (detx s )d 5 l +1 M 5 ). 
cec ses 

The proof of part 1 is given in the Appendix. For G complete, part 4 
above becomes the following. 

Lemma 2.1 ([14]). The Jacobian of the change of variable x G i— > 
y = x~ l G M+ is \y\~ { - r+1) . 

We now introduce the measures which will be the generating measures of 
the new Wishart exponential families on Pg and Qg that we are going to 
define in the next section. Let a and (3 be two real valued functions on C 
and 5, respectively. An example of such functions a and (3 is 

C eC^a(C) = \C\ and S G S ^ p(S) = \S\. 

We denote these examples a = c and (3 = s. Another example is, for a con- 
stant p given, 

C eC^a(C)=p and S G S /?(S) = p, 
simply denoted a = p and = p. For x G we adopt the notation 

The functions i?G for the particular case a = — |(c + 1) and /? = — |(s + 1) 
will play an important role. Indeed, we will use the following as reference 
measures to generate the exponential families of distributions which are the 
central object of our study in this paper. These reference measures are 

(2.6) fi G (dx) = H G {-\{c + 1), -\{s + 1);x)1 Qg (x) dx, 

(2.7) u G {dy) = H G (\{c+l), \{s + l)\<p{y))l Pa {y) dy. 

Applying (2.4), we see that uq is the image of \xg by the mapping x \— > y = 
x~ l and that conversely fie is the image of vq by y i— > x = tp(y). 
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Let denote the cone of positive definite matrices of order d and 
\ q ) denote the space of linear transformations from MP to M. q . For 
x £ Qg-, xc 3 , j = 1, • • • , k, are well defined and it will be convenient to use 
the following standard notation for various block submatrices: 

xsj = x Kj> , x Rj tSj =X{j> = a;<j] , 
(2.8) _ x 

X[j] — xr } , xy]. — X[ J> x <J> 2; < j] , 

where x < j > G M+,xyj. € M c + _ s ,,X[j > G L(]R C!_S J ', M S J '). It is understood here 
that xm = XM. = Xd whereas both x < \ > and xri> vanish. With this nota- 
tion, we have, for example, \x\ = Ylj=i \ x [j}-\- ^ n the proof of our main theo- 
rems, we will need to split the trace (x,y) for x € Qg an d y G Pg following 
a perfect order of the cliques as given in the following lemma. 

Lemma 2.2. Let G be a decomposable graph and let C\, . . . ,C). be a 
perfect order of its cliques. For x G Qg and y G Pg with y = a" 1 and a G Qg, 
we have 

k 

(x,y) = (x,^ 1 ) =X]K x M"°"[i] 1 ) + (( x ii> x <i> ~ a [i> a <l>)i 

(2.9) 

a [i\X x \i> x <i> ~ °[i>°<i>) x <i>)]- 

This is a direct consequence of (2.3) and the following standard splitting 
of the trace for two positive definite matrices u = f" 1 " 12 ) and v = ( VlV12 )-. 

1 \U21U2' \V2lV2' 

(U,V) = (ui,Vl. 2 ) + (U2-1,V 2 ) 

(2.10) 

+ ((^21^1 1 + V 2 1 V 2 l),V 2 {u2lU 1 1 +V 2 1 V 2 l)u X ), 

and its corresponding expression if we write v = <r _1 with a also positive 
definite, 

(u,^" 1 } = (u^cxf 1 ) + (n 2 .i,o-^i) 

(2.11) 

+ ((«2i«i 1 - o-210-i 1 ),a 2 .\(u 2 iu l 1 - a 2 \o- 1 )ux). 

We also recall the following basic results that will be used throughout our 
proofs. 

Lemma 2.3 ([13]). The Jacobian of the change of variables 
(2.12) x eQ G ^y = (xc 1 , scp]., x^x'j^i = 2,...,k) 

is 
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The following lemma gives a Gaussian distribution we shall use later. 
Lemma 2.4. For x and a in Qq, and for L = L(R Ci ~ Sl , R Si ) we have 

(2.14) JL 

Jci—Si)si/2 _ 



7T 



The proof follows immediately from Theorem 3.1.1 in [14] by replacing 
C, D, Y and M in that theorem by au\.,x~^ i> ,xu > x~^ i> and auyU^y, respec- 
tively. 

Let us finally recall the definition of the multivariate Gamma function. 
For p > , the r- multivariate Gamma function is 

(2.15) r r (p) = vr^M-D f[ r(p - \{j - 1)). 

In the sequel we will need the following two formulas which link multivariate 
gamma functions of different dimensions. For c and s two positive integers 

:-l 
2 

s\ r c (a) 



with s < c and for a > a real number, we have 



(2.16) 7r (c-s)s/2 r c _ s ( a 



2) r,(a)' 

fo 17) (c- S ) S /2 r / n _ r c( a ) 

[ U) ic - s W-r s (a-(c- S )/2)- 

2.2. Tools for homogeneous graphs. In this subsection, we study some 
properties of homogeneous graphs. 

Definition 2.1. A graph G is said to be homogeneous if it is decompos- 

12 3 4 

able and does not contain the graph • — • — • — called Aj, as an induced 
subgraph. 

We will see in Theorem 2.2 below why such a graph is called homogeneous. 
We now need to introduce a number of concepts about undirected graphs. 

Definition 2.2. Given an undirected graph 67= (V,E), the associated 
digraph is the directed graph G' = (V, E') with E' derived from £ by the 
following process. If i,j € V, then the directed edge is in E' if and only 
if 

(2.18) {i}Unb(i)^{j}Unb(j), 
where nb(i) = {j; j ^ i,i ~ j}. 
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Note that E' contains all (i, i) for i G V. We write i — > j if and only if 
€ E' . An edge in G can either disappear in G' or become directed or 
become bi-directed. Note that if i ~ j in G then i -/* j if and only if there 
exists k,k^i,k / j, such that the subgraph of G induced by {i,j, k} is 

• — • — Note also that if k 7^ i, then it is impossible to have both 

i 3 k 

and k — > j. In other words, the configuration in G is forbidden. 

Here are two simple examples of digraphs associated to given graphs. For 
the sake of clarity, the loops i — > i are not drawn on the digraph G'. 

12 12 
Examples. The graph • — • becomes the graph G •<->•. The graph A4 

. 12 3 4 

becomes • •— 

It is easy to see from (2.18) that if G is an undirected graph and G' 
its associated digraph, then the relation % — ► j defined on V is a preorder 
relation, that is i — > j and j — > implies i — > fc. Denote by the induced 
equivalence relation defined on V by 

iRj<^i^j and j — ► i 4^ {i} U n6(i) = {j} U nb(j). 

Denote by i the equivalence class in V/R containing i EV and denote by 
i ^ J the partial order relation on V/i2 induced by the preorder i — > j. As 
usual, when dealing with partial order, the notation i -< j means i ^ j and 
i / j. We now introduce the Hasse diagram of V/R. 

Definition 2.3. The Hasse diagram of G is the digraph with vertex set 
Vh = V/R and with edge set Eh such that an edge is in Eh if 

and 

i^k< j implies either k = i or k = j. 

If G Eh, we write i — ► j. The knowledge of the Hasse diagram of G 
is equivalent to the knowledge of the partial order relation on V/R. If i — ► j 
then j is a child of i and i is a parent of j. If z and j are in V it will be 
convenient to write when the corresponding equivalence classes satisfy 
Let us give an example of construction of a Hasse diagram. 

In Figure 1, we give a graph G, its associated digraph G' and the cor- 
responding Hasse diagram. In G', the loops (i, i) are omitted. Since 3 — > 7 
and 7 — ► 3, {3, 7} is an equivalence class denoted by 3 while the 5 other ver- 
tices are alone in their equivalence class, which we denote by i rather than 
i for simplicity. On this particular example, the Hasse diagram is a rooted 
tree associated to a partial order such that the root 1 is the minimum. The 
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four cliques correspond to the endpoints of the tree: C§ = {1,2,3,7}, C4 = 
{1,2,4} ,C 5 = {1,2,5}, C 6 = {1,6}. The two separators Si = {1,2} with 
multiplicity 2 and S2 = {1} with multiplicity 1 correspond to the other ver- 
tices of the diagram. Note that the graph G? is homogeneous since it does 
not contain any as an induced subgraph. The fact that in this example 
the Hasse diagram is a rooted tree and the graph is homogeneous is not a 
coincidence since we have the following characterization theorem. 

Theorem 2.2. Let G = (V, E) be a connected graph and let G' = (V, E') 
be its associated digraph. The following properties are equivalent: 

1. G is homogeneous. 

2. If ' i ~ j then either i — > j or j — > i in G' . 

3. The Hasse diagram of the partially ordered set (V/R, ■<) is a rooted 
tree such that its root 1 is the minimal point of V/R and such that the 
number of children of a vertex is never equal to one. 

4. Pq is a homogeneous cone (i.e., its automorphism group acts on it 
transitively) . 

5. Qg is a homogeneous cone. 

We shall only use equivalences between 1, 2 and 3, which are easy to 
prove. The equivalence with 4 and 5 is stated for the curiosity of the reader. 
The homogeneous graphs are specially simple to handle. We call T the set 
of vertices of the corresponding Hasse tree, so T = V/R. Consider the subset 
of V 



V 



We gather the properties of the Hasse tree of G in the following proposition. 



Proposition 2.2. IfT is the Hasse rooted tree of a homogeneous graph 
G with k cliques and k 1 minimal separators, we have that: 




•■r 



»3 -* »2 



•4 



•5 



•6 



• 1 



•3 * »2 »5 



Fig. 1. G, G' and the Hasse diagram. 
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1. The mapping i \— > Vj, where i € T, gives a one to one correspondence 
between the cliques and minimal separators of G and, respectively, the end- 
points and non-endpoints ofT. In particular, ifk>l the root 1 is a minimal 
separator which is contained in all minimal separators and cliques of G and 
the total number of vertices in T is equal to k + k' . 

2. All orders of the cliques are perfect. The multiplicity u(V s ) of a sepa- 
rator V s is equal to the number of children of s minus one. 

Proof. 1. If i e T then we observe that Vj is complete since if j and I 
are in Vj then either j <l -< i or I -< j ^ i. In both cases j ~ /. Conversely, 
if C C V is complete then C = \J{j G T; j £ C} is contained in some Vj. 
If not there exist j and I in C which are not comparable in the poset T 
and therefore j ^ I, which contradicts the fact that C is complete. Thus 
the maximal cliques are the VjS where i € T has no children, that is, i is an 
endpoint. Finally, if i G T has children j and I then Vj is a minimal separator 
of j and I as can easily be seen. Conversely, if j and I are in V with j ^ I 
there exists a unique minimal separator between them which is Vj where 
i = max{s £ T; s^j, s^.1}. 

2. Consider any order (t(l), . . . ,t(k)) of the endpoints of the tree and the 
corresponding order (V t ^, . . . , V t ^) of the cliques. For j = 2, . . . , k and for 
l = l,...,j-l 

s(l) = max{s £T;s-< t(j),s -< t(l)}. 

Since 1 ^ s(l) -< t(j) for all I = 1, . . . , j — 1, then s(lj) = max{s(Z); 1 = 1,..., j — 
1} exists and 

v s{lj) = (v t(1) u---uv t{j _ 1) )nv t{j) 

is a minimal separator contained in the clique V t ^.) with lj < j. Thus the 
order is perfect. 

Now, given a minimal separator V s , we show that the number v{V s ) of j 
such that there exists lj with 1 <lj < j <k and s = s(L), where (lj,s(lj)) is 
as defined above, is equal to c(s) — 1 where c = c(s) is the number of children 
of s. Suppose first that u(V s ) > c. Then there exist endpoints t(ji), . . . ,t(j c ) 
of T such that ji < ■ • ■ < j c and such that s = = • • • = s(lj c ). Thus 

s -< . . . , s -< t(j c ). Furthermore lj 1 < j\ and s -< t(lj x ). This implies that 
s has at least c + 1 children, a contradiction. Thus z'(Vs) ^ c ( s ) — 1- Finally, 
one sees by induction that the number of edges of an undirected tree is 
the number of vertices minus one. Since J2 S c(s) is equal to the number of 
edges in the graph, this implies that J2 S ( C ( S ) ~ ^) = k — 1, where the sum 
is taken over the non-endpoints s of T. To conclude the proof, we use the 
fact that by definition of the multiplicity of a minimal separator, the sum 
of the u(Vs) is also k — 1. Thus £ s [c(s) - 1 - u(V s )] = (k - 1) - (k - 1) = 0. 
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Since we have a null sum of nonnegative terms we get v(V s ) = c(s) — 1 for 
all minimal separators. □ 

It follows from the proposition above that there is a one to one corre- 
spondence between the set of homogeneous graphs and the set of rooted 
trees with vertices weighted by positive integers and such that no vertex 
has exactly one child. Note that a complete graph is homogeneous. It is 
characterized by the fact that its Hasse diagram is just a point. A decom- 
posable graph with only one separator is homogeneous. Its Hasse tree looks 
like a daisy. An undirected tree is decomposable but is not homogeneous in 
general. Finally it is possible to prove that if all orders of the cliques of a 
decomposable graph G are perfect then G is homogeneous. 

3. The Wishart families of Types I and II. In this section, we define 
two families of Wishart distributions. We will study special cases in Section 
3.2, the homogeneous case in Section 3.3 and the nonhomogeneous case in 
Section 3.4. 

3.1. Definitions. Consider the two integrals 

(3.1) I(a,(3;y)=f H G (a, (3;x)fi G (dx) for y E Pq, 

JQg 

(3.2) J(a,(3;x)=f e'^ H G (a, /3;<p(y))v G (dy) for x E Qg- 

Jp G 

We define A to be the set of (a,/3) such that I(a,(3;y) converges for all 
y £ P G and such that y i— > jj^a^-^(y)) * s a cons t an t on Pq. This constant 
is a function on A that we denote by T\(a,f3). Similarly we define B to be 
the set of (a,/?) such that J(a,(3;x) converges for all x 6 Qq and such that 

x i— ► d^'^'aK is a constant on Qq. This constant is a function on B that we 

H a [a,j3;x) ^ u 

denote by Fu(a,{3). The sets A and B will be studied in Sections 3.3, 3.4 
and 3.5. 

We note here that since hq(cIu) is the image of the measure VQ(dy) under 
the mapping yt-¥ u = tp(y) (see Section 2.1), (3.2) can be written 

J(a,[3;x)= e~ ( ' x,u ~ 1 ' > HG(a,P;u)fiG(du) for x € Qq. 
JQg 

This expression of (3.2) and the passage from y G Pq to u = ip(y) € Qg 
will be used several times in the remainder of the paper for defining the 
inverse Type II Wishart and to perform various computations. The Wishart 
distributions of Type I will be the probabilities 

w R , „ e-^H G (a,0;x)» G (dx), 

Ti{a,f3)HG{a,P;<p(y)) 
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defined on Q G and indexed by the parameters (a,(3;y) in A x P G . To follow 
the standard notation for distributions related to the Wishart, when y G P G 
is the parameter of the Type I Wishart, we often write y = a" 1 with a G 
Qc so that, for a G Q G , (a,/?) G .A, the Type I Wishart distribution can be 
written 

(3.3) W Qg (a, 0, a; dx) = ^ G (dx). 
The Wishart distributions of Type II will be the probabilities 

(3.4) W PG (a,P,0;dy)=e-^ f^f^ ^ G {dy) 

defined on Pq and indexed by the parameters (a, 0; 8) in 23 x Q G . We there- 
fore consider the following two natural exponential families. 

Definition 3.1. For (a, (3) G A, the Type I Wishart family of distribu- 
tions is defined by 

(3.5) = {W QG (a,f3,a;dx),a G Q G }. 

Definition 3.2. For (a,0) G B, the Type II Wishart family of distri- 
butions is defined by 

(3.6) ?{a,P),n = {Wp g (a, 0, 6; dy),6e Q G }. 

Following the pattern of what is done for the Wishart distribution, we 
now define Type I and Type II inverse Wishart and F distributions. 

Definition 3.3. Let G be given. If X ~ WQ G (a,/3,a) where (a, 0) G A 
and a G Q G , then Y = X~ 1 is said to follow the inverse Type I Wishart, 
defined on Pq, and its distribution is 

(3.7) IW QG (a, 0, a; dy) = u G (dy) . 

The distribution (3.7) is clearly immediately derived from the distribution 
(3.3) by recalling that x = <p(y) and that v G (dy) is the image of ^ G (dx) by 
the mapping x h- > y = x" 1 . 

Definition 3.4. Let G be given. If Y ~ Wp a (a,0,6) where (a,/3) G B 
and G Qg 5 then X = <p(Y) is said to follow the inverse Type II Wishart, 
defined on Q G , and its distribution is 

e -{e,&- x )H G ( a R-A 
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Here too, the density (3.8) is immediately derived from (3.4). 

Let B - A = {{a' - a, - p) : (a', 0) € B, (a, P) € A}. Since B - A C B and 
A — B C A are false in general, as will be seen, for example, when G = A±, 
to give the following definition of the F distributions, we will have to insure 
that the parameters a' — a and (3' — (3 are in the correct sets. 

Definition 3.5. Let and a be in P G and Qg, and let (a, (3) € A and 
(a',/3') EB. Then 

1. for (a' — a, (3' — (3) € B,a 6 Qg, the F distribution of the first kind 
with parameters (a, (3, a' , (3' ,a) is the distribution on Qg 

Tiija' - a, f3' - (3) . f , . 
— — H G (-a ,-P;<t) 

Ti(a,f3)T u (a',f3') 

x # G (a' - q,/3' - ^;CT + £)# G (a,^;x)/i G (cte); 

2. for (a — a',/3 — /?') £ A,0 £ Qg, the F distribution of the second kind 
with parameters (a, f3,a' , (3' ,6) is the distribution on Pq 

Ti(a - a' , - 0') 

— — — H G (-a, -P; (p(0)) 

Ti(a,P)T n {a\P') 

x H G (a -a',p- 0- <p{9 + y))H G (a', P'; y{y))v G {dy). 

Note here again that the lack of multiplicative structure on Pq and Qg 
when G is not complete prevents us from relating these distributions to some 
form of quotient X/X' of independent random variables with distributions 
Wq g (a, P; a) and Wq g (a', P';a), respectively. A study of the multivariate F 
distribution when G is complete can be found in [15]. We could also define 
rather explicitly Beta distributions of Type I by introducing the conditional 
distributions X\X + X', where X ~ W QG (a,P,a) and X' ~ W QG (a',P',a) 
are independent such that (a + a',/3 + /?') is still in A. Again, since the 
cone Qg has no special multiplicative structure, these Beta distributions 
unfortunately do not seem to enjoy properties linking them to some ratio 
analogous to Xj {X + X') as happens when the graph is complete. The same 
problem arises with Beta distributions of Type II. Finally, we could also 
consider the distribution 

e^H G (a,(3;x) 

FT — ^T~\ (dx) 

where we only require that (a, p) be such that I(a,P;y) defined by (3.1) 
converges. Under such generality, these distributions have no interesting 
properties: their Laplace transforms are not explicit, their family is not stable 
by convolution as our Wishart distributions of Type I are (see Proposition 
3.2) and they have no hyper Markov property. A similar remark holds for J 
defined by (3.2) and Type II Wisharts. 
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3.2. The hyper and inverse hyper inverse Wishart distributions. We first 
observe that when G is complete, both Type I and Type II Wishart distribu- 
tions coincide with the ordinary Wishart distribution. We will see now that 
for special values of (a,/3), the Type I and II Wisharts are, respectively, the 
hyper Wishart as defined by Dawid and Lauritzen [7] and the G- Wishart 
first identified by Roverato [16] as the inverse of the hyper inverse Wishart 
defined also by Dawid and Lauritzen [7]. To describe these distributions, it 
is convenient to fix a perfect order of the cliques. 

The hyper Wishart on Qg- Let G be given and let p be a scalar. Let A\ 
be the one-dimensional subset of M. k+k defined as 

Ai = |(a,/3)[a(C) =p,C eC,f3(S) =p,S e S with p> max ±(|<7| - 1)J. 
For (a, ff) £ A\ we then have 

(3.9) W QG (a,P,a;dx)oc—T -l Qa {x)dx 

with 

\ Tn |p-(c«+l)/2 _! 
i \ I^Ci -\ x Ci i&n / 

Wc ^a c , XCi )= e o t . 

We note that the expression of WQ G (a, (3,a;dx) in (3.9) does not depend 
on the chosen perfect order of the cliques. The expression on the right-hand 
side of (3.9) is a Markov combination of Wishart distributions with shape 
parameters p and scale parameters ac t and as t on the cliques and separators 
of G, respectively. By Theorem 2.6 of [7], it is a distribution. It is in fact 
the hyper Wishart distribution, as defined in that same paper. Therefore 
both sides of (3.9) are equal and equal to the density of the hyper Wishart 
distribution and it follows immediately that A\ C A for any given G and 

r,(a,g) = n r' raW . 
n5Ur„(j>) 

The G- Wishart on Pq- Let G be given and let 5 > be a scalar. 
Let B\ be the one-dimensional subset of defined as 

Bi = {(a,/3)\a(C) = -\{5 + \C\ - 1), C e C, 

p(S) = -|(<5 + \S\ -l),SeS,5> 0}. 

For (a, (3) € B\ and for x = ip(y) 

TT fc Wr< |-(5+Ci-l)/2+(ci+l)/2 

H G (a,PMy)>o(dy) = |^|_ ( ^_ 1)/2+(s , +1)/2 ^(^)^ 
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where, as before, the expression of Hc{a, f3; y>{y))vG{dy) does not depend 
on any chosen perfect order of the cliques. Therefore, Wp G (a, f3,0;dy) oc 
|y|( (5 -2)/2 e -(9,?/) j g 67- Wishart distribution first identified by Roverato 
[16] as the inverse of the hyper inverse Wishart. It follows immediately that 
BxCB, that 

r n (a,^ n r' r ^ + c -- 1)/2) 
nt 2 r„((i + »i-i)/2) 

and that the Type II Wishart is the G- Wishart defined on Pq for S > 0, 
9 £ Qg- The distribution of this special Type II Wishart is 

nil \o Cl \ {s+c ^ l)/2 m= 2 r Si ((s + Si - 1)/2) 



W Pa (a,P,0) 



nti rc((* + ci - 1)/2) ni 2 i^i^- 1 )/ 2 

x | y |(^)/2 e -(M lpG ( y)d ,. 



3.3. T/ie homogeneous case. We now consider the Type I and II Wishart 
distributions when the graph G is homogeneous as defined in Section 2.2. 
Our aim is to identify the sets A and B and the values of the normalizing 
constants T\{a,(3) and Tu(a,(3). It is convenient to introduce the following 
notation, consistent with the notation introduced in (2.8). For u GV/R, we 
define 

x [u] = { x ij)t=j=ui x [u> = ( x ijH=u,]-<ui 

x <u> ( x ij)i^u,j^ui x [u\- x [u] ~ x [u> x <u> x <u] ■ 

We also adopt the convention that a vertex of the Hasse tree (T, Eh) will be 
denoted by t if it is an endpoint of the tree and by q if it is not an endpoint. 
From Proposition 2.2, to each t corresponds a unique clique Ct = V) U &T u<A u \ 
and therefore a number at = a(Ct). And, to each q corresponds a unique 
minimal separator S q = Uug7>^<jM an d therefore a number j3 q = @(S q ). 
The positive integer v{q) is the number of children of q minus one. From 
Proposition 2.2, this is also the multiplicity of S q . With these conventions, 
for each u G T, we write 

Pu = Pu(a, (3) = a t - v (<l)Pq- 



We define n u to be the cardinality of the vertex u of the Hasse tree, that is, 
the number of vertices in V that are in the vertex u of the Hasse tree of G. 
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We also define 

(3.10) m u = n u + ^n v = ^n v . 



The following two theorems give A,B and the corresponding normalizing 
constants for the Wisharts of Type I and II in the homogeneous case. 

Theorem 3.1. Let G be a homogeneous graph. Then 
A=l(a,P)\p u >l^n v -l\,ueT^. 

More specifically for (a, (3) € A and a € Q G , the integral (3.1) converges and 
e-^^H G (a,(3;x)fi G (dx) 



Qg 



(3.11) 



uGT V v-<u ) 

= H G (a, fj; a) J] ^ n ^ 2 T nu L - £ y Y 

Theorem 3.2. Let G be a homogeneous graph. Then 
B=Ua,P)\-p u >l(^n v -lJ,ueT^. 

More specifically for (a,/?) € B and 6 € Qg, the integral (3.2) converges and 

[ e-^H G (a,(3^(y))u G (dy) 
Jp G 

(3.i2) =u^^ nunv/ %u ] r Pu rnJ-Pu-Y, n ^ 2 ] 

= H G (a, ft 6) ]J ^ n ^ /2 T nu {-p u - £ y Y 

We note that the parameter sets A and B in the homogeneous case are 
(k + /c')-dimensional. The proof of Theorem 3.1 follows the same line as that 
of the proof of Theorem 3.3 given in the Appendix. It is based on Proposition 
3.1 given below and on the following analog of formula (2.9) for the traces: 
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for x and a in Qq, 

(3-13) + ((x [u> x~ l u> - o-[ u )0-<i>), 

a [ll]-( X [u> X <U> ~ a [u>°~<\>) X <U>)], 

where it is understood that, as in (2.8) and (2.9), for u = 1, the root of T, 
the summand reduces to (xi,^ 1 ). Then, using Proposition 3.1 and formula 
(3.13), the integral in (3.11) is obtained by a series of standard integrations. 
The proof of (3.13) is parallel to the proof of (2.9) and will not be given 
here. The proof of Proposition 3.1 is given in the Appendix. 



Proposition 3.1. ForG homogeneous, the image of Hc(a, (5; x)/j,c(dx) 
under the mapping 

(3.14) x = (x [u] , x [u> , u 6T)h {x [u] .,x [u> x^ u> ,u G T) 

is 

H G (a,/3;x)[i G (dx [u] .,dx [u> x<l > ,u G T) 

= II \ x n\ Xu ~ (nu+1)/2 'dx [u] .d{x [u> x^ u> ), 

where 



The proof of Theorem 3.2 also follows the general lines of the proof 
of Theorem 3.4 given in the Appendix. We first observe that the image 
of Hc(a, f3\ip(y))x uc{dy) under the change of variable y i— > x = (p(y) is 
Hc{a, /?; x)pc{dx) so that 



Pg 

r-V,*- 1 ) Hj=l \ x Cj\ j 

where, as usual, the integral on the right-hand side of the equation above 
does not depend upon the chosen perfect order of the cliques. We then use 
(3.14) and (3.13) applied to (#,x -1 ) to obtain the expression of the integral 
in (3.12) by a series of standard integrations. 
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Using Proposition 3.1, (3.11) and (3.13) it is fairly straightforward to 
show that the image of the Type I Wishart by the change of variable (3.14) 
is the distribution 

Wq g (a, 0, a; dx [u] ., d(x [u> x< l u> ) , ueT) 

= J~J | x |^-K+l)/2 e -(s M .,<T-p 

^3 x e -{( x h> x <«>- CT [«> cr <i>)< cr [u[( a; [u> a: <i>- cr [u>o"<i>)^<u>) 

x (> /2 ^--v M rrv, (x u - \ E^)) _1 

X l-D u (%[u]-i ( x [u> x <u>)) dX[ u ]- ^( x [u> x <u^ 

where D u = (M+ u x L(M n ", M m "~ nu )). This distribution is exactly the distri- 
bution of the Wisharts defined by Andersson and Wojnar [3] on homogeneous 
cones. Therefore when G is a homogeneous graph the Type I Wisharts co- 
incide with the Wisharts of [3] for the homogeneous cone Qg corresponding 
to G. Since the dual Pq of Qg is also homogeneous, we could also show that 
the Type II Wisharts correspond to the Wisharts as defined by Andersson 
and Wojnar [3] on Pq- However, there are many other homogeneous cones 
not of the form Pq and Qg- Our calculations are simpler and self contained 
in the particular cases that we investigate here. 

Using (3.15), Proposition 3.1, (3.13) and (3.12), we obtain the image of 
the inverse Type II Wishart by the change of variable (3.14). The image of 
the distribution of X ~ IWp G {a,f3,6), the inverse of the Type II Wishart 
when G is homogeneous, is given by 

/Wp G (a,A^;^[«].,rfa;[«>a;<i>,wG T) 

| XrfJ- |^-("»+l)/2 e -<*M..»M.> 

(3 x e -(( :r [«> a; <«>- 6 '[ U > 6, <i>): 6 '<">( :r <i> a: <u]- 6, <i> £ '< U ]) a: ^.> 

x ^ (1/2) - \0 [u] .r^ nv {-p u -Ey))' 



n 



X 1 D u { x [u\; ( x [u> x <l>)) dX[ u ]. d(x 

[u> x <u>) 



Example. Consider the graph Go (see Figure 2). We index each clique 
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•4 
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X 



•6 



• I 



Fig. 2. Graph G . 



according to the vertex of the Hasse tree of Go which represents it. Thus 
G 3 = {1,2,3}, G 4 = {1,2,4}, C 5 = {1,2,5}, C 6 = {1,6}. Minimal separators 
are Si = {1} and S 2 = {1, 2} with v(S 2 ) = 2 and v{S\) = 1. We set 

a(C 3 ) = a 3 , a (C i )=a A , a(C 5 ) = a 5 , a(G 6 ) = a 6 , 

/3{S 1 ) = /3 1 , P(S 2 ) = p 2 . 

The Hasse tree corresponding to Go is identical to the Hasse diagram of 
Figure 1 with 3 replaced by 3. Since the cardinality of all the vertices of the 
Hasse tree is 1, for the sake of simplicity we will denote the vertices of the 
tree by 1, 2, 3, 4, 5 and 6 so that we have 

m = l, i = l,...,6 and 1^6,1^2,2^3,2^4,2^5, 

Pi = «3 + "4 + «5 + "6 — Pi — 2/02) P2 = "3 + "4 + «5 — 2/?2, 

P3 = a3, /0 4 = a 4 , p 5 = a 5 , p 6 = a e , 



A={(a,P)\pi >0, p 2 >§, Pi > l,t = 3,4,5, /3 6 >i} 

(3.17) ={(a,/9)|ai>l, i = 3,4,5, a 6 > ±, 

a 3 + a 4 + a 5 + a 6 - 2/3 2 - A > 0, a 3 + a 4 + a 5 - 2/J 2 > 5}, 
B = {(a,/3)|pi < -|,p 2 < -f,Pi <0,i = 3,4,5,6} 

(3.18) ={(a,/9)|ai <0, i = 3,4,5,6, 

«3 + «4 + «5 + "6 - 2/?2 — Pi< — f ) a 3 + a 4 + a 5 - 2/3 2 < — |}. 



3.4. The nonhomogeneous case. We now consider a nonhomogeneous 
graph G, that is, a graph containing A4 as an induced subgraph. As in 



and 




Therefore 
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the case of homogeneous graphs, our aim is to identify A,B and the corre- 
sponding eigenvalues. We will see that we are, in fact, only able to identify 
a subset of A and B and the corresponding eigenvalues T(a,(3). The results 
are given in Theorem 3.3 and Theorem 3.4 below. For G a noncomplete 
decomposable graph, let P = (Ci, . . . , C&) be a perfect order of the family C 
of its cliques and (£2, • • • , Sk) be the associated sequence of minimal separa- 
tors. Recall that cy = \Cj\ and Sj = \Sj\ denote the cardinality of Cj and Sj, 
respectively. For given a and (3 we write ay = a(Cy) and f3j = f3(Sj). For a 
given minimal separator S we write 

J(P,S) = {j = 2,...,k\S j = S}, 

and for a given perfect order P of the cliques, we define Ap to be the set of 
(a, /3) such that: 

1. J2j£j(p,s) a j ~ = 0) f° r an $ different of S2; 

2. ay - 2^1 > for all Cj e C; 

3. ai+5 2 > where 5 2 = J2jeJ(p,s 2 ) a j ~ v i S 2)P2- 

Recall also that T n (p) is defined in (2.15). To avoid trivialities, in the 
following statements we assume that G is not complete in order to have 
at least one minimal separator. Theorems 3.3 and 3.4 are useful only for 
nonhomogeneous graphs, since stronger results, Theorems 3.1 and 3.2, are 
available for homogeneous graphs. 

Theorem 3.3. Let G be a noncomplete decomposable graph and let P 
be a perfect order of its cliques. Then Ap C A. More specifically for y € Pq 
and for (a,/3) 6 Ap the integral (3.1) converges and 

(3.19) f e-^H G (a, (3; x)^ G {dx) = T^a, 0)H G (a, (3; <p(y)), 
JQa 

where 

(3.20) ri (a,/3) = T S2 (a 1+ 5 2 )^p\ f[ 

r«(ai) *J- 2 r Sq {a q ) 

Equivalently, if we write y = a^ 1 with a G Q G , (3.19) can be rewritten as 

(3.21) / e-( x ' a ^H G (a,{3-x) t i G (dx) = r 1 (a,[3)H G (a,{3-o-). 
JQa 

To study B for a nonhomogeneous graph and give the normalizing constant 
of the Type II Wishart, we now need to define, for a given P, the set Bp to 
be the set of (a,/?) such that 

1. J2j£j(p,s)( a j + ~ s j)) ~ V {S)P{S) = 0, for all S different from S2; 
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-a ri 



\{c q — s q — 1) > for all q = 2, . . . , k and —a\ — \{c\ — s 2 — 1) > 0; 



3. -ai-|(ci-s 2 + l)-72 > ^2^- where 72 = EjeJ(P,s 2 )( a i -Pz + ^r 1 )- 

Theorem 3.4. Let G be a noncomplete decomposable graph and let P 
be a perfect order of its cliques. Then Bp C B. More specifically for € Q G 
and {a, (5) € Bp the integral (3.2) converges and 

(3.22) f e^H G (a,p;ip(y))u G (dy) = T n (a,(3)H G (a,P;e), 
Jp G 

where 

r / fl \ r r ci-s 2 1 r ci (-ai] 

Fii(a,P) = T s 

(3.23) 



c\ - s 2 
-«i 5 72 



r s2 (-ai - (ci - s 2 )/2) 



x 



j=2 r ^(~ a i ~ ( c i " s j)/2)' 

It is interesting to reexpress (3.22) in a slightly different way. Writing y = 
x^ 1 with x G Qg an d recalling that the image of v G {dy) under y 1— > ip(y) = x 
is fj,Q(dx), we see that (3.22) can be rewritten as 

(3.24) / e^ e -^H G (a,(i-x)^ G {dx) = T ll (a,[j)H G (a,(3-e). 
JQg 

From the two theorems above, it follows immediately that 

Ad\JA p and Bd\JB p , 

p p 

where the union of all Ap and all Bp is taken over all possible perfect orders 
of the cliques of G. Before making some important remarks, let us give an 
example. 

1234 

Example. Consider the graph G = A4 : • — • — • — 

Let Ci = {1,2}, C 2 = {2,3}, C 3 = {3,4}, S 2 = {2}, S 3 = {3}, and let 
a{Ci) = a u i = l,2,3, p(Si) = (3 h i = 2,3. Then P x = (C x , C 2 , C 3 ) and P 2 = 
(C2,Ci,C3) are perfect orders of the cliques. The orders P{ = (C 3 ,C2,Ci) 
and P2 = (C2)C3)Ci) are also perfect orders analogous respectively to P\ 
and P 2 - On the other hand, the only other possible order (Ci, 63,(72) and 
its analog (C 3 ,Ci,C 2 ) are not perfect. Let us therefore identify Ap and -Bp 
for P 1 ,P{ and P 2 ,P^: 

A Pl = {(at, a 2 , 013, @2,lh)\<Xi > h i = 1 > 2 ,3, ai + a 2 - /3 2 > 0, a 3 = /3 3 }, 
Ap/ ={(ai,a 2 ,Q!3,^2,/33)|ai > h i = 1,2,3, a 2 + «3 - /% > 0, ai=/3 2 }, 
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while Ap 2 = Ap 1 and Ap^ = Ap> . In a parallel way, we have 

Bp 1 = {(a 1 ,a 2 ,a3,P2,(h)\ - a» > 0, £ = 1,2,3, 

-ai - a 2 + fh - 1 > 0, a 3 + | = /3 3 }, 
£ P / = {(ai,a 2 ,a 3 ,/3 2 ,/3 3 )| - on > 0, £ = 1,2,3, 

-a 2 - «3 + /% - 1 > 0, ai + \ = 2 }, 
while Bp 2 = Bp 1 and Bp^ = Bp^. 

Remarks. 1. The domains Ap and Bp on which (3.19) and (3.22), 
respectively, or equivalently (3.21) and (3.24) hold, depend upon the chosen 
perfect order P of the cliques. Since the functions Hq do not depend upon 
P, it is clear that, even though the expressions of T\ and Tu depend upon 
P, their values do not. 

2. Since assumption 1 of Theorems 3.3 and 3.4 represents k' — 1 con- 
straints on the set of (a,/3)'s, we see that in general each set Ap is of 
dimension k + 1. 

3. Prom Theorems 3.3 and 3.4, the integrals (3.1) and (3.2) are finite and 
constant multiples of Hc(a, f3;a) and H(j(a, (3; 6) for (a, {3) in \J P Ap and 
{JpBp, respectively. Using Holder's inequality it is immediate to prove that 
these integrals are also finite on the convex hull of [JpAp and {JpBp. So 
the question naturally arises as to whether A and B are larger than {JpAp 
and Up -Bp. We only have a partial answer to this. We have seen in the 
previous section that, when G is homogeneous, A and B are completely 
known and of full dimension k + k 1 . However, if we consider the homoge- 
neous example given in Section 3.3 and treat it using the methods given 
in this section, we will find that the 24 possible orders are all perfect with 
Pi = (Ci, C 2 , CsjCi) and P 2 = (Ci, C4, C 2 , C3) being the only perfect orders 
yielding distinct ^4p's. We have 

A Pl = {(a,/3)\ai > 1, £ = 1,2,3, a 4 > \, a± + a 2 + a 3 - 2/3 2 > 5, a 4 = /3i}, 
Bp L ={(a,P)\a i <0, £ = 1,2,3,4, 

-a-i - a 2 - «3 + 2/32 > §, «4 - A + \ = 0}, 
Ap 2 ={(a,(3)\ai >1, £ = 1,2,3, 

04 > i, ai + «4 - /3i > 0, a 2 + 03 — 2/5 2 = 0}, 
Bp 2 ={(a,(3)\ai < 0, £ = 1,2,3,4, 

— ai - a 4 + /3i > 2, a 2 + a 3 - 2/3 2 + 1 = 0}. 



a ■ 
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Clearly, Ap 1 U Ap 2 is included in, but not equal to, A as given in (3.17). 
Similarly, Bp ± U Bp 2 is included in, but not equal to, B as given in (3.18). 
The question is therefore whether in the nonhomogeneous case it is possible 
to identify A and B. In the next section, we find A and B for G = A4 and 
we see that they are of dimension strictly less than k + k' . Thus G = A^ is 
a counterexample to the hypothesis that in the nonhomogeneous case we 
could also define a set of dimension k + k' on which (3.21) and (3.22) hold. 

3.5. The case G = A4. 

Let G be A^ as in the previous example. Then we write 

/ 01 0"12 \ 
£7 21 CT 2 023 

0"32 <73 034 

V 043 04 / 

for a G Qg-, with oij = aji, Ui.j = 0j — <Tij<rJ l aji and similarly for 6 G Qq. 

Proposition 3.2. Consider the graph G = A^ with cliques and separa- 
tors 

C 1 = {1,2}, C 2 = {2,3}, C 3 = {3,4}, S 2 = {2}, S 3 = {3}. 

Letai = a(Ci), i = 1,2,3, @i = f3(Si), i = 2,3. Define 

Aa = {(a,/3)\cti > |, i = 1,2,3, ai+a 2 >/J 2 , a2 + «3>/33}- 

T/ien t/ie following integral converges for all a G Qyi 4 i/ and only if (a, /3) is 
in Aa- Under these conditions, it is equal to 

e-^ a ^H G (a,p;x)fi G (dx) 

= tt 3 / 2 T fai - I) r (a 2 - ^ r (a 3 - ^ T(ai + a 2 - P. 

(3.25) 



G 



27 V 27 V 2, 
xr(a 2 + a 3 -/? 3 )(r(a 2 )) 0i. 2 2 . 3 03-2 °4 3 



x 2 Fi ai+a 2 -/3 2 ,a 2 + a 3 -/3 3 ,a 2 , 



0203, 

where 2 Fi denotes the hypergeometric function. Similarly we define 

B4 = {(a, 0) \a\ < 0, a 3 < 0, /3 2 — ct\ — a 2 — ^ > 0, 

(3 3 - a 2 - a 3 - 1 > 0, P 2 + 03 - "l - OL 2 - a 3 - § > 0}. 

T/ien i/ie following integral converges for all for 9 G Qg if and only if (a, j3) G 
#4. Under these conditions, it is equal to 
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e-^H G (a,/3;<p(y))v G (dy) 

Pa 

1 ■ 2 2 3 4*3 

x r(-ai)r^/3 2 -ol\ -a 2 - ^r^/3 2 + /?3 - - «2 - a 3 - ^ 

(3.26) xT[l3 3 - a 2 - a 3 - ^r(-a 3 )(r(/3 2 + 3 - a x - a 2 - a 3 - l))^ 1 
/ll 

/?2 + #3 - "1 - a 2 - «3 - 1; 7T7T 



7 2C3 



The results above are obtained by a nontrivial and long computation. A 
central part of this computation is the following lemma. 



Lemma 3.1. Consider the following 2x2 random matrix X 
with the Wishart distribution 
(det c) 



X\ X12 
X\2 X2 



w 2 (p,c 1 ]dx)= e ( - x ' c \x 1 x 2 - x\ 2 ) p 3/2 l M +(x)dx 1 dx 2 dxi 2 



with p > 1/2 and c ■ 



Cl C12 

C12C2 

Mellin transform of (Xi,X 2 ) is 



positive definite. For a\ > —p and a 2 > —p, the 



v (Y a im (detc)P T{a l +p)T{a 2 +p) ( c\ 2 



The proofs of Proposition 3.2 and Lemma 3.1 are omitted. 

We now derive from Proposition 3.2 the sets A and B when G = A4. 

Corollary 3.1. LetG = A 4: . Then A = \JAp and B = \JBp, where the 
unions are taken over the two possible Ap and Bp. The dimension of A and 
B is therefore strictly less than k + k' . 

Proof. Since the two statements are quite similar, we prove the second 
one only. We use the equality (see [1], Formula 15.3.3) 

(3.27) (l-z) a+b ~ c 2 Fi(o, b;c;z)= 2 Fi(c- a,c-b;c;z). 

Using (3.27) on the right hand side of (3.26) above with z = and 
(a,b,c) = (f3 2 - ai - a 2 - ±,/3 3 - a 2 - a 3 - \;(3 2 + /3 3 - ai - a 2 - a 3 - 1), 
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we see that 

n I 1 a m / e-^H G {a,f3-My))vG{dy) = C2F 1 {c-a,c-b;c-z), 
H G {a,/J;V) Jq g 

where C is a constant which does not depend on z. Now clearly from its 
Taylor expansion the hypergeometric function z i — ^2-^1 ( c — a,c — b;c; z) is a 
constant if and only if either c — a or c — b is zero, which together with #4 
proves that constancy occurs if and only if (a,/3) belongs to one of the two 
possible Bp as given in the example of Section 3.4 above. □ 

Remark. One can prove that the convex hulls in R 5 of A and B are, 
respectively, strictly included in A4 and #4 as defined in Proposition 3.2. 



4. Properties of the Type I and II Wisharts. Let us recall that for a 
given decomposable graph G, the r-dimensional graphical Gaussian model 
Markov with respect to G is the family of distributions 

A/" G = {iV r (0,£),£eQ G }. 

Dawid and Lauritzen [7], page 1306, have shown that this model is strong 
meta Markov. This can also be shown directly since, using the notation 
of (2.8), for a given perfect order of the cliques and with the convention 
that sc m. = xci,Sm. = T l c 1 -,x < \ > = 0,ri = c±, the density of X £ W with 
distribution A^O,!!) E Mg can be written as 

k 

(4 1) fix) = 17 1 c (-V2)((x w -x [i> S- i 1 > S <ll ),S- ] 1 (x m -x [i> S- i 1 > £<,])) 

The parameters 

(4.2) F 1 = S Cl (L; = £[;>£-*>, AT* = E M .), i = 2,...,fc, 

of the distributions of Xc^XmlrcH^, i = 2, . . . , A;, respectively, are clearly 
variation independent in the sense that any parameter (Sr^S^^, Em.) of 
the distribution of Xu \Xfj i _ 1 is compatible with any parameter {F±, (Lj,Nj),j < 
i} of Xn i _ 1 and any parameter {Lj,Nj,j > 1} of X\Xui- It then follows that 
for a decomposition (A, B) of G the parameter of the distribution of Xa 
is variation independent of the parameter Sgu of the conditional distribu- 
tion of Xb given Xa- This, according to Definition 4.3 of [7], means that 
the model Ng is strong meta-Markov. For this model, Dawid and Lauritzen 
([7], page 1306 and page 1308) have shown that the distribution of the max- 
imum likelihood estimator of S, that is the hyper Wishart, is weak hyper 
Markov and that the hyper inverse Wishart, the inverse of the G- Wishart, 
is a conjugate prior on S which is strong hyper Markov. 
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We are now going to show parallel results for the Type I and II Wisharts: 
the Type I Wishart is weak hyper Markov, the inverse of the Type II Wishart 
forms a conjugate family for the scale parameter of the Mg model and for 
any direction given to the graph by a perfect order of its cliques, and the 
inverse Type II Wishart is strong directed hyper Markov. Since we have 
seen in Section 3.2 that the hyper Wishart is a particular case of the type 
I Wishart and the hyper inverse Wishart is a particular case of the inverse 
Type II, it is not surprising that their generalizations hold parallel Markov 
properties. 

One might wonder whether the term "hyper" is adequate when talking 
about the weak Markov property of the Type I Wishart since this distribu- 
tion has so far neither been identified as the distribution of an estimator nor 
as a prior distribution for the parameter of a Gaussian model. It is certainly 
adequate for the inverse of the Type II Wishart since, as we are going to 
prove right away in Section 4.1, it forms a conjugate family of prior distribu- 
tions for the scale parameter of the Mg model, with a shape parameter set 
of dimension at least k + 1 . We will then prove the hyper Markov properties 
in Section 4.2. Our main results below are Corollary 4.1 and Theorem 4.4. 

4.1. Conjugate prior distributions. The family of inverse Type II Wishart 
distributions has several properties that make it useful as a rich family of 
conjugate prior distributions for the scale parameter Sg- of the graphical 
Gaussian model Markov with respect to a decomposable graph G. Recall 
that, following the notation used in the Introduction, if £ is the positive 
definite covariance matrix for iV r (0,£) € Mg, then Eg = 7r(S) £ Qg is the 
scale parameter for the N r (0, S) distribution. We have the following general 
result. 

Theorem 4.1. Let G be a decomposable graph and let P be a perfect or- 
der of its cliques. Let D be in Qg, let (a, (3) be in Ap and (a' , (3') be in Bp. If 
the joint distribution of (X, S^) on Qg x Qg is WQ G (a, 0, a; dx)IWp G (a 1 , /?', 
D; da), then the conditional distribution ofT,Q knowing X = x is IWp G (a' — 
a,/3' — (3, D + x;da) and the marginal distribution of X is an F distribution 
of the first kind with parameter (a, j3, a' , /?', D). 

PROOF. The joint distribution of (X, Eg) is 
W Qg (a, 0, a; dx)IWp G (a', (3', D; da) 

= e-^-^H G (a,f3;x) e~( D ^ H G (a' , f3';a) 

ri(a, /3)H G (a, 13; a) T n (a>, f3>)H G (a>, (3'; D) f " G[aa) 

' H G {a,l3;x) 

r : (a, P)T u {ct, (3>)H G (a',P>; D) m X] 

x e- {x+D '°~ l) H G {a' -a, (3'- (3; a) na(da) , 
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from which the result follows immediately. □ 

Theorem 4.1 shows that the family of IWp G distributions is a conjugate 
family for the scale parameter a of the Wq g (a, (3, o~\ dx) . Consider now a 
sample Z\, . . . , Z n from a Gaussian distribution Markov with respect to G, 
and let S = £EiU z i z \- Th en vr(S), the maximum likelihood estimator of 
Yjq is such that nir(S) is hyper Wishart with shape parameter p = § and 
scale parameter Eg, that is, Wishart of Type I with shape parameter 

|C| — 1 

a(C)=p, C^C, f3(S)=p, S^S, p > max — - — 

and scale parameter 2Eg- Applying Theorem 4.1 to X = nir(S), we obtain 
the following corollary. 

Corollary 4.1. Let G be decomposable and let P be a perfect order of 
its cliques. Let (Z±, . . . , Z n ) be a sample from the N r (0, S) distribution with 
£ Qg- If the prior distribution on 2£g is IWp G {a! , /3' , D) with (a',/3') € 
Bp and D € Qq, the posterior distribution of 2T,q, given nS = Ya=i z iZf, 
is IWp G (a' - §,/?' - §,£> + Tv(nS)), where a' - § = («i - f , . . . , a k - §) 
and 15' - § = Q3( - §,..., /3' k - §) are such that (a' - \,0 - f ) € B P and 
D + ir(nS)£Q G . 

This means that the family {IWp G (a, /?, £)), (a, |3) Qg} is a con- 

jugate family for the scale parameter Eg of the Gaussian model Markov 
with respect to G. We note this family has its shape parameter set of di- 
mension at least k + 1 and is therefore much richer than the traditional 
Diaconis-Ylvisaker family with shape parameter set of dimension equal to 
1. 

Theorem 4.1 can also be immediately transcribed to the homogeneous 
case using the variables (x\ u ].,x\ u> x'^ l> ,u G T) and we obtain the following 
result. 

Theorem 4.2. Let G be a homogeneous graph. Let D be in Qg, let (a, (5) 
be in A and (a' , (5') be in B. If (X, Sg) € Qg x Qg o,nd the joint distribution 

E [u> E<i>,tie:r) is W% G (a,(3,a)IWp G (o/,P',D), 
then the conditional distribution of £ knowing X = x is IWp G (a' — a, (3' — 
(3,D + x). 

We now have the following result dual to Theorem 4.1. 

Theorem 4.3. Let P be a perfect order of the cliques of G. Let a be 
in Qg- Let (a, ff) be in Bp and (a',/3') be in Ap. If the joint distribution 
of (y, 0) on Pq x Pg is Wp G (a, /3,9)IWq g {q! ', (3' ,cr), then the conditional 
distribution of® knowing Y =y is IWq g (o! — a, — (3,<p{y + o^ 1 )) . 
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Theorem 4.3 shows that the family of IWq g distributions is a conjugate 
family for the scale parameter 9 of the Wp G (a,(5,0). 

4.2. Markov properties. We now want to show that the Type II inverse 
Wishart IWp G is strong directed hyper Markov and the Type I Wishart weak 
hyper Markov. Let M(G) denote the set of all Markov probabilities over G. 
Let the distribution P p £ M(G) be parametrized by p. Now we randomize p 
according to a law C(p). For any subset A of V, let pa denote the parameter 
of the marginal distribution of Xa (more specifically, we should write p ~a p' 
if the marginal distributions of Xa under P p and P p > coincide and call pa 
the equivalence class of p for the equivalence relation The parameter 
Pa\b °f th e conditional distribution of Xa knowing Xp could be defined in 
a similar way. We say that C{p) is weak hyper Markov over G if under C(p), 
for any decomposition (A, B) of V, 

(4.3) p A II Pb\pahb- 

We say that C(p) is strong hyper Markov over G if, under C(p), for any 
decomposition (A, B) of V, 

(4-4) Pa \b 41 pb- 

Let P be any perfect order of the cliques and consider a perfect numbering 
of the vertices compatible with P (see [11], page 18). Let D be the directed 
graph obtained from G by directing all edges in G from the vertex with the 
smallest number to the vertex with the largest number. We say that a law 
£(p) is weak directed hyper Markov over D if for all v £ V, 

(4-5) p v -11 P pr (v)\Ppa(v), 

where pa{v) denotes the sets of parents of v in D and pr(v) denotes the sets 
of predecessors, that is, the vertices with a lower number than v. 

We say that C{p) is strong directed hyper Markov over D if for all v £V, 

(4-6) Pv\pa(v) 41 P pr (v)- 

Let us also recall that a random variable on is said to follow the 
Wishart w r (p,a) distribution if its density with respect to the Lebesgue 
measure is 

I ip-(r+l)/2 



\a\PT r {p) 

in which case its inverse U = X~ l is said to follow the inverse Wishart distri- 
bution iw r (p,9), where 9 = a -1 , with density with respect to the Lebesgue 
measure equal to 

|.-p-(r+i)/2 . , .. 
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Finally, we will use the notation XM2> and :rm. for 

x\i2> = x Cl \S2,s 2 x sl and x m- = x c 1 \s 2 -s 2 - 

Theorem 4.4. Let G be a decomposable graph G and let P be a perfect 
order of its cliques. Then, for (a, (3) £ Bp and for the direction given by P, 
the inverse Type II Wishart is strong directed hyper Markov. More precisely, 
if X ~ IWp G (a, fi,6) with (a, (3) £ Bp and 9 £ Qg, then, with the convention 
that si = S2, 

x [i\- ~ i^ci-si(-Oii,9[ii.), i = l,...,k, 

x [12> \ x [l}- ~ ^(ci— s 2 )xs 2 (0[12»20 <2 > 

x [i> x <)> \m- ~ N^ Sj)xaj (^b>^<i>' 20 <i> ® j = 2,...,k 

and 

(4.7) {(^[i2>,^[i]0^<2>, (z[j>^<}>,£[j].)> J = 2 ,---,k} 
are mutually independent. 

Proof. From (4.1), we know that Em.) is the parameter of 

the distribution of Zm given -Z<i> when Z ~ N r (0, E) €.Ng- Therefore it 
follows from the remark following Theorem 2.6 and Proposition 3.8 of [7] 
that to construct a weak hyper Markov distribution for p = T,g € Qg, it is 
sufficient to build a weak directed hyper Markov distribution for a given 
direction of the vertices compatible with a given perfect order of the cliques, 
that is, a distribution with density of the form 

p(E G ) =Pc 1 (Sc 1 )n^( S [*> S <i>' S [i]-l S <J>) 
i=2 

(4.8) =pi(S[ 12> ,S[ 1 ].|S < 2>)p<2>(S <2 >) 

fc 

X n^( S [i> S <*>' S [i]-l S <i>)- 
i=2 

If we want to show that, for the given direction, this distribution is in fact 
strong directed hyper Markov, by Proposition 3.13 of [7] it is sufficient to 
show that 

(4.9) {(E [12> , £[!].), E <2> , (E b> S- J 1 > ,S U] .), j = 2, . . . ,k} 

are mutually independent. Let us now show that the Inverse Type II Wishart 
satisfies both (4.8) and (4.9). Let X ~ IWp G (a,fl,9) with (a, (3) £ Bp and 
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9 € Qg- Combining (A. 9) and (A. 11) of the Appendix, we show that the 
image of the IWp G (a,(3, 9;dx) distribution by the change of variables (2.12) 
and (A. 5) is 

IWp G (a,f3,6; dx[i]., dx [12> , d(xg 2 ), d(x [j> x^ j> ), dxyy, j = 2,...,k) 

(4.10) oc e-^m-^W^^^.I^-^-^+iVs^^ |-«2/2 

x e -(( a; [i2>-f[i2>)^^ ] 1 (a;[i2>-f[i2>)f<2>> 

(4.11) x e -^<2>' e <2>)| x<2> r i+ ( Cl " S2 )/ 2+ ^-^ +1 )/ 2 (ix 



<2> 

k 

i a,- (a,- -s,- +11/2 -(xZt ,Bw\.) i 



(4.12) Jix^-^-^^e-^'^ix'-' \-'*/ 2 



3=2 

k 



(4.13) x Y[ e~^ Xb>x< ^ > ~ e[3>e<lj> ^ x ^ X[j>x<1 J > ~ ey>e< ^ > ^ <J> ^ 

3=2 

k 

x dx [12> dx m . J[ d{x^ > x'^ > ) dx {j] .. 

3=2 

We see that the densities (4.10), (4.11), (4.12) and (4.13) with x replaced 
by Eg are exactly of the form required for the respective factors of (4.8). It 
follows that the IWp G {a, (3,6) is weak directed hyper Markov, but we also 
see from (4.10)-(4.13) above that the independence in (4.9) is satisfied and 
therefore IWp G (a, (3,0) is strong directed hyper Markov. The densities of 

(»[!]., X[12>i x <2>, x \j]-> x [j> x <j>' 3 = 2, • • • , k) 

are also clearly as indicated in the theorem. □ 



This strong directed hyper Markov property of the Type II inverse Wishart 
corresponds to the strong hyper Markov property for the inverse G- Wishart, 
that is, the hyper inverse Wishart. We do not quite have the strong hyper 
Markov property because the parameters (a, (3) £ Bp are linked to the per- 
fect order P. The property analogous to the weak hyper Markov property 
of the hyper Wishart is given in the following theorem. 



Theorem 4.5. Let G be a decomposable graph and let P be a perfect 
order of its cliques. Then, for {a, (3) G Ap, the Type I Wishart is weak hyper 
Markov. More precisely, if X ~ WQ G (a, (3, a) with (a, (3) € Ap and a E Qg, 
then 



WISHARTS FOR DECOMPOSABLE GRAPHS 33 

X[i2>\x<2> ~ ^ r ( Cl - S2 )x S2 ( cr [l2> ) 2a; <2> 
x< 2 > ~w S2 (ai + <5 2 ,cr <2> ), 

x b> x<) > |a; <i> ~ JV(«y- Si )x« J -H>^>,2x^ > (8)<7[ J -].) J 

x [i\- ~ ^- Sj (ay - y> *[,•]•) , j = 2,...,k. 

Proof. Using (A. 4) and (A. 6) of the Appendix, we see that the image 
of the Wq g (cx, /3, a; dx) distribution by the change of variables (2.12) and 
(A.5) is 

Wq G {a,f3,a; dx^y, dx [l2> , dx <2> , dxyy, d(xu > x~ 1 j> ), j = 2, . . . ,k) 

(4.14) OC | a;[1] .|°i-^/2-(ci- S2 +l)/2 e -<a :il] . > a- 1 .> ^ 

(4.15) x ^ <2> |( c i-^)/2 e -((x [12> - CT[12> )x <2> (x <211 -<x <211 ) CT - ] 1 ) d:r[i2> 

(4.16) x |z <2> | ai+& - (s2+1)/ V<*< 2 >' CT <2>>dx <2> 

k -1 

(4.17) x n\x [jV \ a J- s > /2 - {c J- s > +1)/2 e- {x[ > ] -> a ^Ux [iy 

3=2 

x \ X<j> \te-'M 2 

U.IS) x e ~^ I ^> x <j>~ CT ^> CT <j> )x< J >(x <j> I <^ _CT <j> CT <j] )CT M 1> 

x d(xy > x < j > ). 

From the expression of the density Wq* (a,(3,a) above, we see that, for x = 
E G) (4.14) and (4.15) give ^(E^y^E^ 1 , E ClV?a . Sa |Esa) of (4.8) while 
(4.16) gives P5 2 (E S2 ) and (4.17) and (4.18) give ^(E^E"^, E [i] .|E <J> ). 
Therefore the WQ G (a,(3,a) Type I Wishart is weak directed hyper Markov 
and, by Proposition 3.8 of [7], weak hyper Markov. □ 

We note that in the proof above, the density pi(E[ C - 1 y5 2 5 2> E^ 2 1 , ^Ci\S2-S 2 |E,g 2 ) 
depends upon E^ 2 and the density p^E^E"^, E[jj. |E<j>) depends upon 
E <;7 > and therefore the WQ G (a,/3,a) Type I Wishart is not strong directed 
hyper Markov. 

4.3. Laplace transforms and expected values. For (a, (3) £ A, 3~( a ,p),i = 
{WQ G (a, P,a;dx),a € Qg} is the natural exponential family generated by 
the measure 

(a \ H c(a,f3;x) (A \ 

^a, P ),G{dx)= »c{dx). 
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For — y € Pq, the Laplace transform of H( a ,f3),G 1S 

JQg 

This is a reformulation of (3.19). It implies that, for — y + <r -1 £ Pq, the 
Laplace transform of WQ G (a, (3, a) is defined by 

(4.19) I 

Jqc H G (a,(3;a) 

Suppose that (a, (3) and (a 1 , (3') in A are such that (a + a', (3 + /?') is still in 
A. This is, for example, true for any G if {a, (3) and {a 1 ,(3') are in the same 
j4p and it is always true if G is homogeneous. We then have the convolution 
formula 

W Qg (a, (3, a) * W Qg (a', 0, a) = W Qg (a + a',(3 + 0', a), 

a result which would be difficult to prove using densities alone, that is, with- 
out Theorem 3.3. Let us also mention some properties of the NEF ^"(q,^),/. 
From Theorem 2.1(1), we deduce that L^, fl) G {y) is finite if and only if 
— y £ Pq- Since —Pq is an open subset of Zq (see [12]), F( a m^ is a regular 
family in the sense of [4] and the domain of the means Mf (a . : of the family 
^{a,/3),i coincides with the interior of the closed convex support of the family. 
Thus Mjr (a : = Qg- The cumulant function of fJ-^a^)^ m 

k k 

(4.20) /c M(Q ^ )G (y) = ^a J logdet((- ?/ - 1 ) C7j )-^/3 J logdet((-y- 1 )5 J ). 

j=i i=2 

The computation of its differential requires some care and it is done in the 
Appendix in Propositions A.l and A. 2. We give the result here. If X ~ 

W QG {a,(3,a), for y = cr~ l e P G , 

E W QG (a,f3,a)( X ) 

( 4 - 21 ) =^**w>. (v) 

k k 

= zZ a 3 T ( d v\c 3) c J (J c]) a c j - /Z ( ) 3 T ( d v\s 3) s^~sl) <T s ] , 

3=1 3=1 



where 



l Gi \f ac . 0\/I Cj o c )ra Ci y\ c . 

a v (■,■'■, a <\ T v\Cj J V °/ V I V \ C] 
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(4-22) 

= ( °Ci a c 3 y\c 3 \ 

\ °v\c 3 ,c } °v\c 3 °c] a c 3 y\c 3 J 
= a-a v \ Cj . Cj 

with a similar expression for T(ay\s j ,Sj 's^) a Sj- This implies that if X ~ 
WQ G (a,P,a), then 



E W QG ( a ,f3,a) 



/ k k \ k k 

\j=l j=2 J j=l 3=1 



We have parallel results for the Type II Wishart. We note that for (a, (3) £ B 

F(a,p),n = {W Pg (a, (3, 9; dy),eeQ G } 
is the natural exponential family generated by the measure 

v { a,p), G (dy)= rn(a>/3) Mdy). 

Here the Laplace transform L U{a G {x) is finite if and only if x is in —Qg 
(Theorem 2.1, part 1.), which is an open subset of Iq (see [12]), and the 
domain of the means of ^F a ,p,\i is Pg- We have L U{a „ G (x) = HG(a,(3] —x). 
This implies that the Laplace transform of Wp G (a, (3;6) is defined for — x + 
9 G Qg by 



(4.23) / e <*'f>Wp G (a,/?,Ml/) 



Ip g ~ " r8 ^ r '"'""' H G (a,f3;6) ' 
The cumulant transform is 

^(cw.g^) = ^ a j 1 °S det ((~ 2; )c J ) -Zl^ lo S det ((~ x )' 
and its differential is given by the following element of P G : 

k k 







(4.24) k f v (x) = £ ay(xa})° - £ 

3=1 3=2 

This implies that if Y ~ Wp G (a,/3,0), then 

(4.25) ^(^(Y) =E«i(-^) -Y,0j(- s})°- 

3=1 3=2 

To conclude this section, let us make a few remarks. Formula (4.25) with 
Y replaced by S^ 1 , (a,/3) replaced by (a' — a, (3' — /3) and # replaced by 
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D + x gives the posterior mean E(Yi~^}~\X = x) of the inverse of the natural 
parameter T>q for WQ G (a, (3, Y*q) in Theorem 4.1 when the prior distribu- 
tion on T,g is IWp G (a' , @',D). It is, of course, also of interest to compute 
the posterior mean of the natural parameter Y>q. In other words, we need 
Ewp G {a' -a,p' -p,D+x)(<P(Y)) when Y = S^ 1 . We have the general formula 

(4-26) +E(«J + 2L ^) E w PG ( a ,wM Y )v\c J -c ] ) 

k 

+ Sj ^)Ew PG { a AoM Y )v\s rSj )- 

3=2 V 7 

The proof is not straightforward. To derive (4.26), we use Stokes' formula 
and obtain 

0=/ (u'(y)v(y)+u(y)v'(y))dy 
Jp G 

with u(y) = H G (a + ^,/?+ ^,<p(y)), v(y) = e< e '«>, and where (a, (3) G 
S satisfy some restrictions similar to the restrictions for the existence of 
the expected value of the inverse Wishart, that is p > for the w r (p,a) 
distribution. Using the same substitutions for a, (5,9 in (4.26) as we did in 
(4.25), we see that (4.26) implies that in Theorem 4.1, the posterior mean 
of T,q is not linear in x; this is in accordance with Theorem 3 of [9]. We 
also see from Corollary 4.1 that when X = ir(S) E Qg follows the hyper 
Wishart distribution with (a, (3) = (§,§) and the shape parameters of the 
prior on S(j are (a',/?'), then the shape parameters of the posterior are 
(a' — — §)• That is, as for the inverse Wishart or the hyper inverse 
Wishart, the parameters (a 1 , j3') of the IWp G are added to half of the sample 
size and from (4.26) the choice of (a 1 , (3') has the same kind of impact on 
the posterior mean as the choice of the shape parameters for the inverse or 
hyper inverse Wishart. 

Let us also mention here that the IWp G (a, (3,6) distribution as given by 
equations (4.10)-(4.12) is conditionally (k + l)-reducible and is an enriched 
conjugate family of prior distributions, in the sense of [5], for the parameter 
Tiq = 7r(E) of a Gaussian distribution Markov with respect to G. This follows 
immediately from Theorem 4.4. The IWp* G {a,(3,0) is also closely linked to 
the enriched standard conjugate Wishart family of priors for K = in 
the standard Gaussian distribution, that is, when G is a complete graph, 
built by Consonni and Veronese [6]. Theorem 2 and Corollary 1 in that 
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paper correspond to Theorem 4.4 here. However, we should note that it is 
the Wp G (a, (3,9) family that is an exponential family, not the IWp G (a, (3,9) 
family and therefore the analog of the enriched Wishart, for G decomposable, 
is Wp G {a,(3,9). 

5. Open problems. We will now raise some natural questions related to 
the paper. 

Singularity. The well-known Gyndikin theorem states that the mapping 9 *— > 
(det(— 9))~ p from — to (0, oo) is the Laplace transform of some positive 
measure fi p on symmetric real matrices of order n if and only if p is in the 
set 

[I 2 r-n fr-1 \ 

— } u hr'°°> 

A very readable proof of this theorem can be found in [17]. The natural expo- 
nential family generated by fx p is the set of Wishart distributions with shape 
parameter p. If p = j/2 with j = 1, . . . ,n — 1, then \i p is concentrated on the 
singular semipositive definite matrices of rank j. For a decomposable graph 
G on V = {1, . . . , r} and for (a, (3) € A, the mapping y \— > Hc(a, (3; —ip(y)) 
from — Pq to (0, oo) is the Laplace transform of a positive measure on Qg 
which generates the natural exponential family of Wishart distributions of 
Type I. Natural questions are: 

• For which values of a, (3 is y i— > Hg(c(,(3; —(p(y)) the Laplace transform of 
some positive measure on Ig? 

• How do we describe these measures? 

Similar questions arise with the Wishart distributions of Type II: for which 
values of a, (3 is the mapping x ^> Hc(a, (3; —x) from —Qg to (0, oo) the 
Laplace transform of some positive measure on Zg? 

Complex and quaternionic numbers. Wishart matrices with complex and 
quaternionic entries are well defined. Thus many concepts of the present 
paper are extendable to complex or quaternionic matrices in a rather me- 
chanical way. 




Fig. 3. The case n = m = 3. 
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The sets A and B. Is it true that A = U Ap and that B = U Bp for any non- 
homogeneous graph? Calculations are terrifying for the graph A§ : • — • — • — 
• — •. On the other hand, this conjecture is easily proved for the tree repre- 
sented in Figure 3, with n + m + 2 vertices denoted ai, . . . ,a n , b\, . . . , b m ,2, 3 
with edges ai ~ 2, 6^ ~ 3 for all i,j and 2 ~ 3. To prove it, we need only 
extend the calculations of Proposition 3.2 and Corollary 3.1. 

APPENDIX 

A.l. Proofs of Section 2. 

Proof of Theorem 2.1(1). The dual of Pq isPg = {x G Ia;tr(xy) > 
for all y G Pg \ {0}} where Pq is the closure of Pg, that is, the cone of pos- 
itive semidefinite matrices of Zq. To show that Qg = P<j we wm show 
that Qg C P g and then that Qg D Pq. If x G Qg, then by Theorem 2.1 
there exists a symmetric positive definite matrix x which is the comple- 
tion of x. Thus tr(xy) = tr(xy) for all y G Zq. Furthermore, if y G Pg \ {0} 
then tv(xy) = tr((x) 1 / 2 y(£) 1 / 2 ). Since the matrix ((x) 1 / 2 y(i) 1 / 2 ) is positive 
semidefinite and nonzero, its trace is positive. Thus x G P G and Qg C Pq is 
proved. 

Conversely, take x £ Ig such that x G Pg. Fix a clique C and consider 
a vector u of M r such that the components of v which are not in C are 0. 
Denote by vc and by xc the restrictions of v to C and toCxC, respectively, 
and assume that v ^0 and thus vc ^ 0. Since vv t is in Pg \ {0} and since 
x G Pgr, < tr(xOT 4 ) = u'xv = v c xcvc- Moreover, this is true for any vc ^ 
and therefore xc is positive definite. Since this is true for all cliques, we 
deduce that x is in Qg and Qg D Pq is also proved. We thank S. Andersson 
for this result and this proof. Our former proof was longer and relied on the 
description of the extremal lines of the cones Pg and Qg given in [12] . □ 

A.2. Proofs of Section 3.3. 

Proof of Proposition 3.1. For convenience, we agree to write t for 
a vertex of T corresponding to a clique C while we write q for a vertex of 
T corresponding to a separator S. As defined in (3.10), m v = J2u-<v n u and 
for any v G T, we have 

\ x c\ = IIl x M-l' \ x s\ = IIl^M'l' \C\ = c = m t , \S\ = s = m q . 

This means that |xr u i.| will appear in \xq\ for any C such that u ^ t and in 
any jx^j for any S such that u<q. Therefore 
n|xc|at _ (c+1)/2 

Yl\ xs \"(q)(P q -{s+l)/2) 
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= n ix [u] .p«^ (Qt " (mi+i)/2) ~^^^ (9)(/39 " (m9+i)/2) 

(A.i) = [] | X[u] jS^ t ^-E^^(9)^)| JH j-(E^ t (^+ 1 )/ 2 -E^^(9)(^+i)/2) 
= jj| x [(E^t^-E^,"^)^ j-(E^W 2 +E^W 2 +(^+ 1 )/ 2 ) ) 

where the third equality above follows from the definition of m v and the 
fact that for any vertex q of the tree, v{q) is equal to the number of children 
of q minus 1 (see part 2 of Proposition 2.2). We now make the change of 
variables (3.14). The Jacobian of this change of variables is 

(\ n v 
nw 

Therefore we obtain the image of Hc(a, (3,x)(j,g(cIx) as 
H G (a,f3,x)fi* G l Y[dx [u] .d(x 

\u> x <u>) I 
\u£T / 



U\ x [t]- 



a t -(m t +l)/2 

(E^ t K?)&+£^ n «/ 2 "E^ u n,/2)-K+l)/2 



X |^[lt]-| "-'^ 

(A.3) Xds^.^^a;^) 

= n ix M .i ( ^^' a ^^^« iy(9)/39+ ^^ nu/2 "^^« n " /2) ~ {nu+i)/2 

tier 

= II k[«]-! A "" (nil+1)/2 ^M-^[«>^<i>) 5 



where 



Au = E^-E%)/3 9 + Ey-Ey- 

u^t u<q u<v v~<u LI 
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A. 3. Proofs of Section 3.4. In the sequel, in order to avoid numbering 
difficulties for separators with multiplicity greater than one, we sometimes 
use the generic notation S for a separator and v(S) for its multiplicity. 
However, when it is important to list the separators as they appear from a 
perfect order of the cliques, we denote the separator Sj by (J). The double 
notation should not cause any difficulty. 

Proof of Theorem 3.3. To avoid any ambiguity in the notation in 
the proof below and all the other proofs in the reminder of the paper, let us 
recall that the set S of distinct separators contains k' < k — 1 elements. For 
convenience, let us write A for the left-hand side of (3.19) and y = a~ l for 
a 6 Qg- Using the Jacobian (2.13) and formula (2.9), we then have 

r , „ T\ k i\xr | Q j-( c i +1 )/ 2 
a H SeS \ Xs \HSW(S)-(\S\+l)/2) ax 

\xc | ai- ( Cl+1 )/ 2 e~ <a:cri '' 7 ^ 
x TJ e -(( x b> x <]>- a y> a <j>)>%].( x ij> x <]>- (T ij> C7 <]>) x <j>} 

3=2 

TJ | X5 |E ie j(p, S) K-( c »+ 1 )/ 2 )-^ s ')(' 3 (- s )-(l- s l+ 1 )/ 2 ) 

k 

J] \x S \^ P ^ Cl ~ UiS)lSl dx Cl J] dix^X-^dxyy 



X 

Ses 



x 

ses j=2 



Now, since the cardinality of J(P, S) is equal to v(S) and, by assumption 
1 of the theorem, all \xsj\ = \x<j>\,j ^ 2, appear with exponent equal to 
Cj 2 Sj while \xs 2 \ = |^<2>| appears with exponent equal to J2ieJ(p,S2) ai ~ 
v(q)(3g + EieJ(P,s 2 ) £l ir-i we have 

| XCiri -(c 1 +l)/2 e -<xc 1 <)| x<2> |<5 2 
k 

(A.4) X TT| z<j> |fe- s 3)/ 2 e _ ^[.j>*<J>^ 



3=2 



X ^r^ 12 -^- 8 ^ 1 ^-^'^ dx Cl fl dix^x'^dxyy 

i=2 
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Clearly (x [k] ., x [fc> x~^ > ) is independent of x ClU ... uCfe _ 1 = (xc^x^x'^^^y, 

j = 2, . . . ,k — 1) and xnu. is independent of x^x"^. Therefore holding all 

other variables fixed, we first integrate with respect to (x^y, 

Since < k > C C\ U • • • U Cfc_i, then x<fc> is fixed and by Lemma 2.4 we 

obtain 

, ,a k -s k /2-(c k -s k +l)/2 e -{x [k] .,*^) . 

M + " 1 J 

c fc- s fe 



\ x <k>\ 

L(R 3 k,R c k- a k) 



(c k -s k )/2 



x e ~(( x [k>^J :> -('[k>(' < l > )^ [k l{x[k>x < 1 k> -^[k>^ < 1 k> )x<k>) 

x d(x [k> x-^ k> ) 

lm„ r s <=/ 2 lr , |( c fe- s fe)/ 2 

Ffc-I F<fc>l 

L{R a k,R c k- s k) 

x e -((^[fc>^<t>- CT [fe> CT <fe>).Vj 1 (^[fc>^<fc > -qfc>^<fc > )^<fc>> 
X <%fc>Z<fc>) 

= 7 r (Sfc(Cfc - Sfe))/2 r Cfe - Sfe (« fc -f)k w .r fc . 

Repeating this process successively for j = k — 1, . . . , 2, we obtain 

fc 



s 
2 



i=2 

x / |x Ci r i " (ci+1)/2 e- <a ' Cl < > |x <2> | <52 ( ixc 1 . 



In this last integral, setting, as in Section 4.2, 

x [l\- = Xd\S 2 ~ x C 1 \S2,S2 X S2 X S2,C 1 \S 2 i x [12> = x Ci\S 2 ,S2 X sl '' 
we make the change of variable 

(A.5) x Cl i-> (x [1 ].,X[ 12> ,x<2>) 

with Jacobian equal to |x < 2>| Cl_S2 . Then, by (2.11) 

|xCiri -( Cl+ l)/2 e -<x Cl ,<) |x<2>! 5 2dxCi 



(A.6) = /laj^.r 1 -' 8 / 2 -^ 1 -' 9 ^/^-^ 1 ]-'^^ 
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X / | x<2> | Ql+ ( Cl - S2 )/ 2 +' 5 2-(^+l)/2 e -<a;<2>,<Ts 2 1 > 



MJ 2 



e -<(-[12>-[l 2 >), CTm 1 (-[12>-[l 2 >)-<2>> dx j dx<2> 

L(R s 2,Rci- a 2) 

M, .ai - a (ci-a a )/2 



r ci _ S2 ( ai - — n s 



X / J%<2>\ 



a 1 -s 2 /2-(c 1 -S2+l)/2+c 1 -S2-{c 1 -S2)/2+52 



M+ 



xe- {x<2> ' a ^dx <2> 



iai 



x / |x <2> r i - (s2+1)/2+52 e- <a;<2> < > ( ix <2> 



m; 2 



(A.7) =7r^( Cl - S2 )/ 2 |cTc i r i k52| 52 r ci - S2 (a 1 - |)r s2 ( ai + fc). 

Now, let us observe that using the multiplicity of S 2 and assumption 1 of 
the theorem, we obtain 

A, .a, kc 2 r 2 A Kg 

,= 2 r^2l J=3 r^l 

(A.8) 

kc 2 r nUKi^ 



Combining (A. 5), (A.7) and (A.8), we obtain (3.19) with 
Ti(a,/3) = T S2 (a 1 + S 2 ) 

x 7r (l/2)(ci-a 2 )s 2r 



C1-S2 



S2 

«i-y 



To obtain (3.20), we use (2.16). □ 



Proof of Theorem 3.4. For convenience let us denote by B the left- 
hand side of (3.22). Using first (p(y) = i£ Qg and the Jacobian (2.4) and 
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then, making the change of variable (2.12) with Jacobian (2.13) and using 
(2.9), we have 

n _r - tf «-i> nj=iKr^ +1)/2 

ho* Uses \x S \^)WS)-(\s\+i)m ax 

, ^ |ai-(ci+l)/2 e -(*oJ' fl cri> 



xnis b1 r , '" (<y+1)/2 e~ <<e M 1 -'^-> 

x J| e -<( x b>^<}>- e [j> e <)>)^y ] 1 (^[ J >^<->-%>e< 3 1 > )e< J >> 

x j| | X5 |E ieJ( P,s)(^-( c >+ 1 )/ 2 )-^)(/ 3 ( s )-(l s l+ 1 )/ 2 ) 



n |x 5 i e ^w 5 ) ci "" (5)|5| ^ Ci 



X 

SeS 

A' 

X 



Y[ d ( x [j> x <j>) dx [j]-- 

3=1 



By assumption 1 of the theorem, this is equal to 

B= j \x Cl \ aiHci+1)/2 e~ {x ^' eCl) \x <2 >\^ 



(A.9) 



J'=2 



x e -(( x [ J > s <j>- e [j> e <j>)^[,] 1 ^b>^<}>- e [ J > e < 3 1 >) e <J>) d:z . c , i 

x n^[j>^<)>)^[j]-- 

i=2 

Clearly xc x , (xy > x'^j > ,xyy,j = 2, . . . , k) are mutually independent and 

k 

B = B l x l[Bj, 

i=2 



where 



B x = f |x Ct r-^ +1 )/ 2 e-^^>|x <2> rdx Cl 
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and, using Lemma 2.1 in the third equality below, we have 



B 



-(( x b> x < 1 j > - e lj> 8 < 1 3> )^i 1 U x b> x < 1 3> - e [j> d < 1 j > ) e <j>) 



"3 °3 



3 -/.\r .. \.//.../r ./,'' ~ 

x d(x\j > x < j > 



X X 



c: — s 



3 3 



^•jv>3—j>,' j |^ > |-(«y-»i)/2| a .-l|-ai+(c,-«j+l)/2-(c,-5 J +l) 



AT 



= ^ ( ^^l^>r fe - -i)/2 |e w .rrc,-. J (-a i ). 

Therefore 

(A.10) B=n^ J ^"'^l^>r fe "* ,) ^l%rr q ,_. J (-a i )xB 1 . 
To compute Bi, let us make the change of variable (A. 5). Then 

Bi = J \x[i] j ai -( ci+i )/ 2 e ~^ri] 1 ' [ i ]^ e ~^ x [ i2 >~ e [ i2 > ) ' x ri] 1(x [ i2 >~ £ '[ i2 > :)e ' <2>> 

(A.ll) X e -< CC <2 > ' 6, <2>> |rc <2> |«l-( c l~ s 2)/2-( S 2 + l)/2+T2 + (c 1 - S2 ) 

X d,X[iy dX[i 2> C?X<2>. 

Integrating with respect to x^ 2> and using Lemma 2.1, we obtain 

B 1 =ie <2> r (ci - aa)/ v ci - aa > a / 3 

c l- s 2 



X / e -( 2 -'<2>' e < 2 >} 

| 6 , <2> |-(ci-s 2 )/2 7r ( Cl - S2 ) S2 /2 

c l- s 2 
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(A.12) 



-{a: < 2 > ,6»<2>> 



Ml 



X \X 



-1 |-ai-(ci-sa)/2-72-(s 2 +l)/2 
<2>l 



7r (c 1 -s 2 )s 2 /2|^ 



<2> \-^-^ /2 \e [x] r 



xr cl _ S2 (-a 1 )|e <2> r i+(ci - S2)/2+ ^r, 



<2>y 



Cl - S 2 

~ a i n 72 



^ (ci - S2)S2/ ¥<2>r i+72 |0[i].r i r Cl - S2 (-a 1 )r S2 (-a 1 



ci - s 2 



72 



^(ci-S2W2|0 Ci j ai |0 



>r i r cl _ S2 (-ai)r S2 ( -ai - 1 2 - 72 



Let us now observe that 

k 



ni^>i 

i=2 



(A.13) 



n* =2 iflc 3 - g |g <J ->i-^-^-^)/ 2 n ge s i^r (s)/3(s) 



Yl ses \e s \^ms) 1W 



E 



a j +(l/2)( C ,- S ,)+^(S)/3(S) 



<2>| 



-72 



n se5 i^r (5)/3(5) 

Combining (A.10), (A.12) and (A.13), we obtain 

r (l/2)ac 1 -s 2 )s 2 +J2 h j=2 (cj-sj)s ] ) 



r n (a,/3) =7r v 



x r 



*2 



ci - s 2 
■«i n 72 



Cl— S2 V 



-«i)II r «y--i(- 



-a 7 



i=2 



To obtain (3.23), we use (2.17). □ 
A.4. Proofs of Section 4.3. 

Proposition A.l. For —y e M+ and C C {l,...,r} denote ac{y) 
((—y)~ 1 )c- We write y by blocks corresponding to C and its complement 

2/1 2/12 
2/21 2/2 
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We denote for simplicity y' 12 = 2/122/2 • With this notation the differential of 
y^a c (y) is 



h 



hi 
h2i 



hi2 
h 2 



[o-c 



(A.14) 

Furthermore, the differential of y 
h 



(A.15) 



tr 



tr 





hx2~ 




Mi 


h 2 _ 




- hx 


hx 2 ~ 




Mi 


h 2 _ 





-^CVX2\ 

o~c -o~cy'x2 
-y'ix°c y' 2 xo-cy'x2 

o-c 



'hx 


hx2~ 




o-c 


Mx 


h 2 _ 




-y' 2 i°~c_ 



log det ac (y) is 



°~c,v\c 
&v\c,c ^v\c,c°~c °~c,v\c 
-1 



where the last equality is due to the fact that y[ 2 — ~°-c G cy\c- 
Proof. We know that 

o-c{y) = = -{yx- yx 2 y 2 1 y 2 iT l - 

Let Mc and Mq denote the restrictions of M and M+ to the clique C. 
Then o~c{y) = a o b(y) where a:—M^ — > M^t is defined by a(x) = — x~ l and 
has differential h 1— ► a'(x)(h) = x~ l hx~ l (a linear application from Mc to 
Mc) and where 6:— M + — > — Mq is defined by b(y) = yx — yx 2 y 2 1 y 2 x- The 
differential of b is the following linear mapping from M to Mq- 



(A.16) 



hx 
h 2 x 



hx2 
h 2 



hx - h 12 y 2 l y2x - 2/122/2 l h 2 x + 2/122/2 1/l 22/2 1 2/2i 
[1 



-y'i2. 



'hx 


hx2~ 






Mi 


h 2 _ 




.-2/21. 



Finally we apply the composition of differentials to obtain 

a' c (y)(h) = (aoby(y)(h)=a'(b(y))(b'(y)(h))=ac(y)(b'(y)(h))a c (y), 

which gives (A.14) when combined with (A.16). Now consider the real func- 
tion I defined on M^t by l(x) = log det x. Then its differential is the linear 
form on Mc defined by h 1— > l'{x){h) = tr(x~ 1 /i). Thus the differential of the 
real function on M+ defined by / o cr c is the following linear form on M: 

h ^ (I o a c )'(y)(h) = ti a c l a' c {y){h), 

which gives (A.15) when combined with (A.14). □ 



We will now use the previous proposition to compute m = k'^(y). We need 
to introduce the notation he for the restriction of h E Zq to C x C, when 
C C V the notation C = V \ C and the notation hc,c' an d hc,c f° r the 
restrictions of h to C x C' and C x C, respectively. 
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Proposition A. 2. The differential of the real function y \— > kn(y) de- 
fined on Pq by (4.20) is the linear form on Zq defined by 

k 

h ^Y^ a i [toQiCj o-c, ) - 2 tx(hc> ,c, <rCi vk ,C ) + tr i h c' v'c' A a C j vk ,c )] 

„ J 3 J 3 J J 3 

k 

-zZPiMhSjVSj) - 2tr(/t S; 'S^SjVs^S') + tT ( h s'y's',s j (7 s j ys j ,s')} 

j=2 



yhvxc^Cj h V \c 3 J \o-v\Cjfij (J v\c h c ] o- c .o- { 



k 

k 

|^ tr \hv\ s . tS . h v \ Sj J \v v \ Sj , Sj avXSjftVs-VSiWi 

Proof. We apply (A. 15) to each term of the sum k^. The proposition 
has been established for the cone M+ and we apply it here to the restriction 
Pg = MjF n Zq of . Therefore the formulas for the differentials of func- 
tions restricted to this subspace are still in force when interpreted as linear 
applications defined on Zq- D 
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